the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.

Decadal re-forecasts of glacier climatic mass balance
Larissa Nora van der Laan
Anouk Vlug
Adam A. Scaife
Fabien Maussion
Kristian Förster
We present the first study employing decadal re-forecasts to simulate global glacier climatic mass balance, bridging the gap between seasonal forecasts and long-term projections of glacier contributions to catchment hydrology and sea-level rise. Using the Open Global Glacier Model (OGGM) and Coupled Model Intercomparison Project Phase 6 (CMIP6) decadal re-forecasts of temperature and precipitation, we demonstrate the predictive skill of glacier mass balance re-forecasts over decadal timescales in two components: for a set of 279 reference glaciers, making use of their mass balance record, and all land-terminating glaciers, making use of the globally available geodetic mass balance, respectively. Results show that forcing OGGM with decadal re-forecasts outperforms persistence forecasts and historical general circulation model (GCM) simulations. Specifically, out of 279 reference glaciers, 174 show improved skill when forcing OGGM with decadal re-forecasts for decadal mean mass balance and 186 show improved skill for cumulative mass balance. On a global scale, forcing with decadal re-forecasts yields the best agreement with observed regional mean mass balances for the period 2000–2020. These findings demonstrate moderate improvements from using decadal re-forecasts, though statistical significance is limited. While improvements are modest, the results suggest decadal re-forecasts may offer potential for improved near-term glacier predictions relevant to hydrological applications, particularly in regions where near-term forecasts can inform water resource management and climate adaptation strategies.
- Article
(1954 KB) - Full-text XML
- BibTeX
- EndNote
As unique indicators of climate change, water storage reservoirs and culturally significant sites, glaciers serve a multitude of purposes (Allison, 2015; Bosson et al., 2019; Farinotti et al., 2020; Jansson et al., 2003). Observing and simulating their response to climate change on various timescales, from millennia to a focus on the past century, comprise an essential and continuously developing field of study (e.g., Goosse et al., 2018; Hock et al., 2019; Malles and Marzeion, 2021; Marzeion et al., 2017; Roe et al., 2021; Vargo et al., 2020). While the storage outside the Greenland and Antarctic ice sheets only constitutes a small percentage of total global freshwater storage in ice, it is equivalent to approximately 0.32 m of sea-level rise (Farinotti et al., 2019). Since these smaller ice bodies respond fastest to changes in the climate, glaciers (outside of the ice sheets) were the largest contributor to sea-level rise for most of the past century (Frederikse et al., 2020), overtaken by thermosteric contribution after 1970, and are expected to remain a significant contributor in the foreseeable future (Frederikse et al., 2020; Slangen et al., 2017).
Water is accumulated and released by glaciers on various timescales, ranging from long-term storage in ice and firn to short-term storage in snow cover. Within their basins, glaciers act as a buffering system, preventing precipitation from immediately turning into runoff in downstream rivers (Jansson et al., 2003). The seasonality of glacier runoff therefore modulates downstream flow, providing meltwater in otherwise potentially dry seasons or years of low flow (Huss and Hock, 2018; Ultee et al., 2022; Förster and van der Laan, 2022). Due to this buffering capacity, they are essential parts of global water towers, defined as mountain range water storage and supply to downstream communities and ecosystems, upon which 22 % of the global population is dependent for their water needs (Immerzeel et al., 2020).
With changes in climate, glacier mass balance – a temporal integration of both accumulation and melt, largely governed by temperature and precipitation – is altered, impacting over time the glacier mass and thus storage capacity. Despite their critical relevance, decadal timescales are rarely considered in glacier modeling studies. This omission is significant, given that such timescales are critical for water resource management, anticipating glacier-change-induced impacts on catchment hydrology (Frans et al., 2016; Lane and Nienow, 2019). The need for annual to decadal predictions is well recognized, despite the early stage of the field (Boer et al., 2016; Merryfield et al., 2020). When using the term decadal in this study, it encompasses timescales of 1 to 10 years. The term “decadal prediction”, as used here, encompasses predictions on annual, multi-annual and decadal timescales (Boer et al., 2016).
In 2016, the World Climate Research Programme (WCRP), co-sponsored by the World Meteorological Organization (WMO), the Intergovernmental Oceanographic Commission (IOC) of UNESCO, and the International Science Council (ISC), set up the Grand Challenge on Near Term Climate Prediction, to make the case for, and understand the challenges in establishing, routine operational climate predictions on these timescales (Kushnir et al., 2019). As of now, there are multiple ensembles of model hindcasts/re-forecasts available. These terms are used interchangeably in the literature. In this study, for consistency, we will use the term re-forecast, defined here as a retrospective prediction (Boer et al., 2016), realized with the aim to evaluate these retrospective predictions against observations and provide insight into our capacity to provide real decadal forecasts.
With the advent of operational predictions (Hermanson et al., 2022), there is growing research activity into the application of decadal forecasts (Dunstone et al., 2022). Glacier modeling is one such field, where there is a gap between seasonal modeling of glacier mass balance and runoff (e.g., Koziol and Arnold, 2018; Réveillet et al., 2018) and the more established modeling on the century and millennial scale, often using downscaled general circulation model (GCM) output (e.g., Huston et al., 2021; Rounce et al., 2023). By quantifying and improving the predictability of glacier mass balance on the decadal scale, it may be possible to bridge this gap. If so, the resulting mass balance predictions could also be translated into glacier runoff, serving as an important input for water resource decisions, which often operate on this timescale (Kiem and Verdon-Kidd, 2011).
The aim of this study is to investigate the utility of forcing a mass balance model with decadal-scale re-forecasts, to complement current mass balance modeling studies and the timescales they are commonly conducted on – centuries and millennia. As far as we know, this study is the first of its kind in large-scale (regional to global) glacier modeling, following the testing of the applicability of decadal re-forecasts in impact models for other research disciplines, such as marine biology (Payne et al., 2022) and the agricultural sector (Solaraju-Murali et al., 2022). A compact review of applications, including a preliminary version of the current study, is presented in O'Kane et al. (2023). The current paper presents a modeling study using the Open Global Glacier Model (OGGM; Maussion et al., 2019), structured into two main components, on different spatial scales. The first component focuses on a set of 279 reference glaciers, while the second examines all global land-terminating glaciers. The OGGM simulations are driven by a multi-model, multi-member ensemble of monthly temperature and precipitation re-forecasts from the Coupled Model Intercomparison Project Phase 6 (CMIP6) Decadal Climate Prediction Project (DCPP; Boer et al., 2016). To evaluate the added skill of decadal re-forecast forcing, we compare these results with simulations using two alternative experiments: forcing with a simple persistence method and with uninitialized, free-running historical GCM outputs and projections from the same models. This latter approach represents the traditional forcing method typically used for 21st-century glacier simulations.
2.1 Model
We use OGGM v.1.5.3 (Maussion et al., 2019, 2022) for the first component of the study – the simulation of reference glaciers – and a slightly updated version, in terms of calibration, which was not an official release, for the global runs (see Sect. 2.2.2). OGGM is an open-source modeling framework written in Python. It was developed to provide a catchment- to global-scale, modular numerical modeling framework for various study setups of glacier evolution on multiple scales while accounting for glacier geometry and ice dynamics. Maussion et al. (2019) and the continuously expanding and adapting model documentation at http://docs.oggm.org (last access: 12 March 2025) explain the model in detail and can be referred to for an in-depth model description. The current study focuses on the application of OGGM's mass balance modeling capabilities rather than the glacier flow.
2.1.1 Methodology
The model takes a glacier-centric approach, using the outline of a glacier as a starting point. By default, the glacier outlines are automatically taken from the Randolph Glacier Inventory (RGI) version 6.0 (Pfeffer et al., 2014; RGI Consortium, 2017). Given a glacier outline and topographical and climate data, OGGM aims to “(i) provide a local map of the glacier including topography and hypsometry, (ii) estimate the glacier's total ice volume and compute a map of the bedrock topography, (iii) compute the surface climatic mass balance and (if applicable) its frontal ablation, (iv) simulate the glacier's dynamical evolution under various climate forcings and (v) provide an estimate of the uncertainties associated with the modeling chain” (Maussion et al., 2019; Recinos et al., 2019). In this study, we solely use the pre-processing and mass balance modeling capabilities of the model, not the dynamical modeling tools. We call this a “fixed-geometry approach”; i.e., the surface area and elevation of the glacier are fixed when computing glacier-wide mass balance. This simplification assumes that geometry feedbacks are negligible at annual-to-decadal scales and is justified by the dominance of other uncertainties, such as the unknown glacier state in the past (e.g., Eis et al., 2021). Furthermore, we only evaluate the climatic mass balance component and do not evaluate calving or other mass loss processes: from now on, we will use the term “mass balance” in place of “climatic mass balance” for simplicity.
2.1.2 Mass balance
The mass balance module selected for the model setup is the OGGM default in v.1.5.3. Mass balance is calculated using a temperature index model which generates monthly accumulation and ablation along the glacier:
in which mi is monthly mass balance at elevation z and Ti constitutes the monthly temperature that is adjusted based on its elevation using a temperature lapse rate of 6.5 K km−1. The threshold temperature Ti(z)−Tmelt, above which melting occurs, is set to the default of −1 °C. Here, the precipitation correction factor pf is set to 2.5 globally (Maussion et al., 2019). is the monthly solid precipitation, computed as a fraction of total precipitation, based on the monthly mean temperature. The temperature sensitivity of a glacier is indicated by calibrated parameter μ*, and ϵ is an optional residual, determined during calibration. In this study we make use of two independent calibration procedures, one making use of observations of glaciers with a mass balance record of at least 5 consecutive years (Sect. 2.2.1) and the other based on a global dataset (Sect. 2.2.2).
2.1.3 Study structure and simulations
The study is divided into two components, each with different spatial scales; see Table 1. For the first component, we focus on 279 glaciers with direct mass balance observations from the World Glacier Monitoring Service (WGMS, N=279 glaciers; WGMS, 2022). These glaciers, referred to as reference glaciers, represent land-terminating glaciers with robust observational records, spanning at least 5 consecutive years per glacier. Over the period 2000–2020, 2676 separate annual mass balance measurements are available for the 279 glaciers. For the second component, the mass balance of the approximately 214 000 land-terminating glaciers globally is simulated. In both components, the geometry of the glacier is based on the state at the RGI inventory date, usually between the years 2000 and 2010, and remains unchanged throughout the simulation. Validation with observed data is done for the period 2000–2020.
The study aim is to analyze forcing with decadal re-forecasts. However, in order to put the results into context, we perform a total of three experiments, with different complexity, for each component. The experiments represent forecasting from very simple to complex methods.
These experiments are defined as follows:
-
Decadal re-forecast: OGGM is forced with a 21-member multi-model ensemble of CMIP6 DCPP-A decadal re-forecasts (see Sect. 2.4 and Table 1). All different realizations (ensemble members) are downscaled to the glacier scale and run for all available decades. Final output: a multi-model ensemble mean of results, from averaging results of the simulations with 21 members. This yields a time series with one mass balance value per year and glacier, 2000–2020.
-
Persistence: OGGM is forced with a simple, persistence-type forecast, where each period (lead times 1 to 9 years; see Sect. 2.3) is the same as the one that precedes it (from here on referred to as “persistence forecast”). For this, we use the baseline climate of the Climatic Research Unit (CRU) data. For example, the mass balance results from CRU forcing 1990–2000 form the persistence forecast for the period 2000–2010. Persistence forecasts are a typical null hypothesis against which other forecast skill is measured (Hargreaves, 2010). Final output: a time series with one mass balance value per year and glacier, 2000–2020, based on the mean simulated mass balance under the baseline climate for the respective preceding decade.
-
GCM historical: OGGM is forced with a 21-member ensemble of CMIP6 historical simulations for 2000–2014, using climate projections for 2014–2020; see Sect. 2.4. Historical GCM runs and GCM projections are the current state of the art in forcing glacier models for the 20th and 21st century, including on the near-term timescale (see Hock et al., 2019; Slangen et al., 2017; and Zekollari et al., 2022, for overview papers). Final output: a multi-model ensemble mean of results, from averaging results of the simulations with 21 members. This yields a time series with one mass balance value per year and glacier, 2000–2020.
The purpose of this combination of experiments is to assess the added value of the decadal re-forecasts over a naïve forecast method (persistence) and with the current state of the art (GCM historical), as well as analyzing forecast skill of decadal and persistence (re-)forecasts at different lead times.
2.2 Mass balance model calibration
For each component of our study, we carry out a separate model calibration. It must be noted that for the purpose of our study, the performance of the mass balance model itself is secondary, since only the change in performance when using various forcing products is investigated. The calibrated parameters are held constant for each forcing product, allowing us to assess the impact of the forcing strategy alone, not the impact of calibration.
2.2.1 Component 1 – 279 reference glaciers – calibration with WGMS data
For the first component, the mass balance calibration procedure is carried out with baseline climate CRU – see Sect. 2.4 – over the years with observed data that fall within the CRU climate time series (1901–2020). We use the default calibration procedure as of OGGM v.1.5.3, described in Marzeion et al. (2012) and Maussion et al. (2019). For all years with observations, the model output is then compared to observations to identify the best candidate for μ* and ϵ on a glacier-by-glacier basis. Because all of these 279 glaciers have observations, the parameters do not need to be transferred to glaciers without observations and the mass balance model is calibrated to match observations over the calibration period.
2.2.2 Component 2 – global glaciers – calibration with geodetic data
For the second component of this study, OGGM is calibrated separately, also using baseline climate CRU (Sect. 2.4). We benefit from the dataset by Hugonnet et al. (2021), providing geodetic mass balance estimates for 94 % of all global glaciers over the period 2000–2020. This dataset facilitates a broader calibration for the global runs, incorporating glaciers beyond the WGMS reference set. The monthly mass balance is computed as in Eq. (1) but without making use of the residual ϵ, since this dataset consists of data for each glacier, removing the need for parameter transfer to glaciers without observations. More detailed information on the use of residual ϵ can be found in Marzeion et al. (2012). The calibration prioritizes temperature sensitivity parameter μ*, calibrated to match the glacier's geodetic mass balance of the period 2000–2020 (Hugonnet et al., 2021). Note that the re-calibration for the global run is a practical necessity (we are simulating all glaciers globally) but has no bearing on our results, since we compare OGGM results with different forcing strategies; i.e., we are assessing not the model or its parameters but the forcing data used for the simulations. It must be added that in our study, we will always run the model during the period it has been calibrated for. This means that when run with the baseline climate CRU, it provides “perfect results”: exactly matching observations over the calibration period (bias of zero).
2.3 Lead time and ensembles
Due to the importance of the initial state of the climate system (atmosphere and ocean) for decadal prediction, forecast skill often declines with lead time (Zhu et al., 2019). Lead time here adheres to the definition by the American Meteorological Society: “The length of time between the issuance of a forecast and the occurrence of the phenomena that were predicted”. To assess the lead-time-based skill in the context of our study, we create lead-time-based ensemble means of results in the decadal re-forecast experiment to validate against observations. This results in nine time series of mass balances, 2000–2020, with input from lead times 1–9, respectively. Due to the clipping to hydrological years to match the WGMS measurements, lead time 0 does not exist in component 1. So for a decadal re-forecast initialized in 1990 (always in November), the first full year of simulated values is for the year 1992. In component 2, the first full year of simulated values would be 1991. For the persistence experiment, we also create lead-time-based time series. Here, lead time refers to the forecast length, which is the same time period until the start of the forecasts. An example of lead time 2 persistence forecast would be the forecast period 2000–2001 being the same as the year 1998–1999. In the case of our study period 2000–2020, the lead time 1 persistence forecast uses temperature and precipitation from the time period 1999–2019, and lead time 9 therefore uses values from 1991–2011 for the forecast. In our decadal re-forecast experiment, we create ensemble means of results as they would be utilized in practice. As Risbey et al. (2021) note, many assessments of re-forecast skill are likely overestimated, as the re-forecasts are informed by observations over the period assessed that would not be available to real forecasts. In order to avoid this as much as possible, we only assess a period that was not used in the drift correction (see Sect. 2.5) and use only lead times that would be available at the beginning of the forecast period. This results in time series for each glacier and each full decade in the period 2000–2020. For an example decade, say 2000–2010, the ensemble mean of results consists of information from all re-forecasts initialized in 1990–2000. This means the ensemble size decreases over time, with only lead time 9 information being available in 2009. Decreasing ensemble size at longer lead times typically increases forecast uncertainty, potentially affecting skill metrics slightly. We use information from all lead times available to maximize ensemble size and, in turn, skill (Kadow et al., 2017). This approach again reiterates the fact that decadal re-forecasts are multi-annual, rather than strictly for 10 years. In order to compare persistence as it would be used in practice, for decadal mean and cumulative mass balance forecasting, persistence forecasts at lead time 9 are applied.
2.4 Climate data
The mass balance model in OGGM requires monthly climate in the form of reference height (2 m) air temperature and precipitation time series. The default baseline climate forcing, which we use for our persistence experiment, is the gridded Climatic Research Unit Time Series (CRU TS v4.01; Harris et al., 2020). This coarse (0.5°) dataset is then interpolated to a higher-resolution climatology (CRU CL v2.0 at 10′ resolution; New et al., 2002) following the anomaly mapping approach described in Harris et al. (2020) to acquire climate time series with elevation data, which is not an attribute in the CRU TS. For each glacier, the monthly time series of temperature and precipitation are taken from the grid point closest to the glacier. Temperature is converted using an elevation-based lapse rate of 6.5 K km−1, and precipitation is corrected using the default correction factor of pf=2.5 (Maussion et al., 2019). While the CRU dataset constitutes our OGGM baseline climate for calibration, OGGM can also be supplied with GCM output that is bias-corrected to the baseline climate, as we do in the decadal re-forecast and GCM historical experiments.
In the decadal re-forecast experiment, OGGM is driven with a multi-model, multi-member retrospective ensemble of monthly temperature and precipitation re-forecasts from the DCPP component A, which provides re-forecasts. We use decadal realizations from the “Flexible Global Ocean-Atmosphere-Land System Model” (FGOALS; Zhou et al., 2018), the “Norwegian Climate Prediction Model” (NorCPM; Counillon et al., 2016; Bethke et al., 2021) and the “Model for Interdisciplinary Research on Climate version 6” (MIROC6; Tatebe et al., 2019; Kataoka et al., 2020). We use the r1i1p1f1-r1i1p1f10 realization of all models where available. The DCPP-A decadal re-forecasts are initialized each year in the period 1960–2010, the first forecast year being 1961. The processing of the decadal data is explained below, in Sect. 2.5.
In the GCM historical experiment, we drive OGGM with temperature and precipitation from the historical iteration and projections of the same three GCMs, obtained from CMIP6 archived model output: FGOALS, NorCPM and MIROC6 (see Table 1). The end of the historical simulation is in 2014, and data from 2015–2020 are provided by projection runs, leading to 11 full decades in the period 2000–2020. As the number of available data becomes larger because of the different shared socio-economic pathways (SSPs), the choice is to select certain SSPs or to introduce a discrepancy in ensemble size if using all CMIP6 SSPs. We realize that neither option facilitates perfect comparison with the decadal re-forecast and persistence experiment. However, the benefit of comparison with projections outweighs these concerns, as initialized forecast vs. projections represents the most realistic future use case. To preserve ensemble size, SSP2-4.5 was chosen as the projection for comparison, as it represents a medium pathway of future greenhouse gas emissions. From FGOALS runs, not enough SSP2-4.5 realizations were available at the time of the study, so SSP1-2.6, SSP2-4.5 and SSP5-8.5 time series were used. As these scenarios do not diverge significantly during 2015–2020, we still consider these results comparable.
In the pre-processing for this experiment, the time series are bias-corrected and downscaled to the glacier scale using a variation of the delta method (Ramírez Villegas and Jarvis, 2010). Here, we take GCM anomalies relative to the 1961–1990 GCM mean for temperature and apply these to the CRU TS 4.01 (Harris et al., 2020) 1961–1990 means. The correction is applied monthly and ensures that the mean and standard deviation are preserved during the bias correction period for temperature. Precipitation is corrected with a multiplicative factor and preserves only the mean. This is the standard method of processing GCM data for projection studies (e.g., Zekollari et al., 2020) and is the reason we use it as an evaluation procedure for the decadal re-forecasts.
2.5 Re-forecast drift correction and downscaling
In order to drive OGGM with the re-forecasts, each member is downscaled to the glacier scale using a statistical method applied with baseline climate CRU (New et al., 2002; Harris et al., 2020). Decadal re-forecasts experience a bias referred to as drift because they start from an initialized state constrained by observations, which is inconsistent with the model's dynamics (Kharin et al., 2012; Manzanas, 2020). The drift is lead time dependent because, with time progressing, the model drifts away more from the initial state towards a state more consistent with the model's climatology, which can lead to significant error (Pasternack et al., 2021). The re-forecasts have to be bias-corrected to counter this error. Our correction adheres to recommendations in Boer et al. (2016), who recommend an overarching bias correction method, regardless of the initialization type of the forecast. The reasons behind these recommendations are discussed in depth in, for example, Boer et al. (2016), Kharin et al. (2012) and Hossain et al. (2022). We assume that the bias contained in each member is model dependent and lead time dependent. Because of the assumption that the bias is different at and dependent on each lead time, subtracting a mean drift per member would lead to over-compensation at some lead times and residual drift at others. For this reason, we create lead-time-based climatologies per model, meaning one climatology over 1971–2000 which contains all lead times 1 year from the re-forecasts, one which contains all lead times 2 years from the re-forecasts and so on. These are then used to create anomalies relative to the baseline climate. For each model, each member is bias-corrected according to
in which dt is the average model bias or drift at each lead time t, calculated relative to the baseline climate input CRU. CRUcl is the CRU monthly climatology averaged over 1971–2000 (Harris et al., 2020; New et al., 2002). is the model climatology at lead time t calculated by averaging the lead time t re-forecasts of all ensemble members of one model falling into the period 1971–2000. is the raw re-forecast, of member m at lead time t for year y. is the bias-corrected re-forecast of member m at lead time t for year y.
Lead-time-dependent bias correction is a fundamental step used in decadal forecasting, as is mean bias correction in future projections for impact modeling. Because the aim of our study is to compare the standard methodologies commonly applied in the respective fields, the GCM historical data are processed as they would typically be in glacier modeling (e.g., Zekollari et al., 2024; Rounce et al., 2023; and many more), and the re-forecasts are processed according to the recommendations mentioned above. It must be added that the drift correction, following the lead time (the length of time between the issuance of a forecast and the occurrence of the phenomena that were predicted), does not apply to the GCM historical experiment, where we have only one simulation per realization.
This section evaluates the results of the three experiments – decadal re-forecast, persistence and GCM historical – by comparing them to observed glacier mass balance data for individual years and decadal mean and cumulative mass balance. The mean and cumulative decadal mass balances are distinct from each other as the cumulative mass balances only include the simulated years for which observations exist. Throughout our analysis, we compare observational time series which inherently contain inter-annual variability to ensemble means from multi-member forecasts. It is important to recognize that ensemble means naturally reduce variability by averaging across multiple realizations, thereby smoothing internal fluctuations that occur in any single realization or observed series. This reduction in variability means that some of the absolute errors (e.g., the mean absolute error) used in the evaluation arise from the difference in variability between a single observational realization and the ensemble mean. Despite this limitation, ensemble means represent the most commonly used forecast product in practical applications due to their stability and reliability.
3.1 Component 1: WGMS glaciers
We first focus on individual years, assessing how skill changes with lead time. To quantify model skill, we calculate the mean absolute error (MAE) and the Pearson correlation coefficient (r) for each lead-time-based ensemble mean, with the results displayed in Fig. 1. The decadal re-forecast experiment shows consistent performance, with an MAE of 0.64 m w.e. at lead time 1 and a slightly lower value of 0.63 m w.e. at lead time 4, before increasing slightly to 0.65 m w.e. by lead time 9. Pearson correlation coefficients decrease from 0.14 at lead time 1 to 0.07 at lead time 9, reflecting the inherent difficulty of simulating individual annual mass balance values. For the persistence experiment, MAE increases from 0.69 m w.e. at lead time 1 to 0.72 m w.e. at lead time 9, with greater variability in correlation compared to the decadal re-forecast forcing. By contrast, results from the GCM historical experiment, which does not depend on lead time, show constant skill. As there is no application of lead time to the GCM historical experiment, its results (MAE = 0.66 m w.e., r=0.01) are plotted as a constant to compare the skill score magnitude. Overall, skill is low when simulating individual years, which is expected and in line with other studies using OGGM for this set of reference glaciers. One example is Eis et al. (2021), who yield an MAE of 0.60 m w.e. with baseline climate CRU over the time period from 1917 until the RGI outline date per glacier (often early 2000s). Simulating single-year mass balances is not the focus of this study; however, the results show that already on the single-year level, forcing OGGM with decadal re-forecasts outperforms the persistence and the GCM historical experiments, which both show higher errors and lower correlations.

Figure 1Forecast skill for annual mean mass balance for WGMS reference glaciers (N=279). Forecast skill is given as the Pearson correlation coefficient (r) in (a) and mean absolute error (MAE, m w.e.) between observed and simulated mass balance observations (N=2676) in (b). Skill is plotted as a function of lead time into the future, calculated across the appropriate comparison periods (2000–2020) for the decadal re-forecasts and persistence forecasts. The mean skill scores of the GCM historical (2000–2020) simulations are also shown for reference, as a constant, since there is no application of lead time. Shaded areas for both persistence and decadal re-forecast (RF) denote the 90 % confidence interval estimated by bootstrapping: 5 % of the distribution is above and 5 % below the shaded areas.
Next, we compare mean and cumulative mass balance over full decades, as illustrated in Fig. 2 and Table 2. To quantify model skill, we look at the mean error (ME), which is the average difference between observed and simulated values (sometimes called “mean bias”); the mean absolute error (MAE); and the Pearson correlation coefficient. In the time period 2000–2020, we have 11 full decades for which the mean and cumulative mass balance are calculated per glacier. The full decades, which have significant overlap, are still compared separately and depicted in Fig. 2 to show how the choice of decade can considerably impact skill statistics. For example, in the decadal re-forecast experiment, the decade 2001–2011 has a mean model error of −0.022 m w.e., whereas the decade 2002–2012 has a mean model error of 0.11 m w.e.
Observed mean mass balance for the decade 2000–2010 is −0.79 m w.e. Simulations using reanalysis data (CRU), which provide a benchmark for comparison, yield a mean error (ME) of −0.037 m w.e. and an MAE of 0.23 m w.e. Decadal re-forecasts produce comparable results, with an ME of 0.091 m w.e. and an MAE of 0.27 m w.e., while persistence forecasts display a larger ME of −0.16 m w.e. and an MAE of 0.39 m w.e. The GCM historical experiment shows slightly better performance than persistence, with an ME of −0.0059 m w.e. and an MAE of 0.29 m w.e. Correlation coefficients are moderate to high, with decadal re-forecasts achieving the highest correlation (r=0.64), followed by GCM historical (r=0.61) and persistence (r=0.58).
In terms of cumulative decadal mass balance, error patterns are similar. Decadal re-forecasts achieve an ME of −0.39 m w.e. and an MAE of 1.33 m w.e., while persistence experiments have an ME of −0.96 m w.e. and an MAE of 1.62 m w.e., reflecting the difficulty of using persistence forecasts for warming-sensitive variables like glacier mass balance. GCM historical simulations show comparable results to decadal re-forecasts, with an ME of −0.27 m w.e. and an MAE of 1.58 m w.e.
To gauge the statistical significance of the differences between experiments, we carry out a two-tailed t test (significance level 0.05) on the set of individual glaciers. The samples for these tests are the 279 glacier-specific values of decadal mean and cumulative mass balance for each forcing type. For decadal re-forecasts and GCM historical experiments, these values are ensemble means, whereas for the persistence forecast, a single-member simulation is used. For the decadal mean mass balance, the difference between the decadal re-forecast and persistence experiment is statistically significant, as is the difference between the persistence and GCM historical experiment. However, there is no significant difference between the decadal re-forecast and GCM historical experiments. For the cumulative mean mass balance, there are no statistically significant differences amongst the experiments. We also perform a binomial test, assessing improved skill by specifically evaluating reductions in mean absolute error for the decadal re-forecast relative to persistence or historical forcing. This shows that out of the 279 glaciers we analyze, 174 showed improved skill using decadal re-forecasts for decadal mean mass balance and 186 showed improved skill for cumulative mass balance. Using a binomial significance, this suggests that the overall improvement from using decadal re-forecasts is significant at the 5 % level. We note that the t test and binomial test can yield different interpretations because they test different aspects of statistical significance. The t test compares the mean differences between forecast methods across the entire set of glaciers, assessing whether differences in the average performance are statistically distinguishable given the spread in the dataset. By contrast, the binomial test assesses significance based only on the count of glaciers showing improved performance (in terms of lower mean absolute error) when comparing one forecast method directly with another, irrespective of the magnitude of improvement.
Thus, while in some cases, the t test does not indicate a statistically significant difference, due to variability across glaciers and the relatively small magnitude of improvement on average, the binomial test can indicate statistical significance simply because more glaciers improve under one approach compared to another than would be expected by chance alone. Both tests provide complementary perspectives: the t test assesses the robustness of the average improvement magnitude across glaciers, whereas the binomial test evaluates consistency in improvement across individual glaciers.
The persistence experiment shows a notable overestimation of mass balance, particularly at longer lead times. This is evident in Fig. 2, where persistence forecasts systematically deviate from the observed values due to a lag in warming trends. This highlights the limitations of persistence-based methods for forecasting glacier mass balance on decadal timescales, where warming trends have significant influence, as is the case with glacier mass balance.
The skill differences between the experiments are further quantified in Table 2. Here for the reference glaciers, forcing with decadal re-forecasts outperforms forcing with persistence forecasts or historical GCM data, even though the absolute improvements in skill are small and statistically not significant between the decadal re-forecast and GCM historical experiments, for individual glaciers. Pearson correlation coefficients between simulated and observed values are generally in the same range for all three experiments, with the highest correlations for the decadal re-forecast experiment. All Pearson correlations are moderate – up to 0.64 for the decadal re-forecast experiment – for mean decadal mass balance and high – up to 0.85 – for cumulative decadal mass balance. Comparison these correlations to the low degree of correlation when simulating individual years emphasizes how the skill of our experiments lies primarily in simulating multi-annual averaged or cumulative mass balances, filtering out inter-annual noise. This is in line with similar approaches, where skill is found through the integration of fluxes over time, such as in seasonal snow accumulation (Förster et al., 2018).

Figure 2Simulated and observed mean mass balances over each full decade (N=11) over all 279 reference glaciers. Only simulated values where observed values are available are used to generate the decadal means.
Table 2Summary of the comparison between WGMS-observed and simulated mass balances for all three experiments. The statistics shown are ME (model error), MAE (mean absolute error) and Pearson correlation, as well as half the interquartile range of the particular statistic.

We also note here that in the decadal re-forecast experiment, the multi-model ensemble mean of results yields higher skill than single-model ensembles. Comparing the multi-model ensemble mean of results to observations of mean decadal mass balance vs. single-model ensemble means of results (N=3) compared to observations yields a decrease in MAE of 11 % (vs. FGOALS), 8 % (vs. NorCPM) and 6 % (vs. MIROC6), respectively. This aligns with our expectations from the literature due to an increase in ensemble size and associated error cancellation and the separate forecast systems adding signal to the multi-model ensemble (Kadow et al., 2017; Delgado-Torres et al., 2022). Other studies applying multi-model ensembles in impact models, such as Payne et al. (2022), also come to the conclusion that a multi-model ensemble generally gives the best performance, which is why we do not explore single-model ensemble performance further.
3.2 Component 2: global glaciers
For all global land-terminating glaciers, we compare results of mean decadal mass balance in all experiments. To assess skill, 2000–2010 and 2010–2020 results are validated against the global geodetic mass balance dataset by Hugonnet et al. (2021). Because the time period 2000–2020 is covered by both validation datasets, a separate comparison of the two is provided in Sect. 3.3.
Comparing 2000–2010 decadal re-forecast skill scores to the persistence and GCM historical experiments, improvement in skill is slight (Table 3). Over the decade 2000–2010, decadal re-forecasts yield a 23 % and 18 % reduction in mean absolute error (MAE) relative to persistence and GCM historical forcing, respectively. Over this period, the MAE for the decadal re-forecast experiment was 0.28 m w.e., compared to 0.35 m w.e. for persistence forecasts and 0.33 m w.e. for the GCM historical experiment. The Pearson correlation coefficients similarly favor re-forecasts, potentially indicating higher overall skill on a global scale. Performing a two-tailed t test (significance level 0.05) on the collection of glaciers shows that while the difference between the persistence experiment and the two other experiments is significant, the difference between the decadal re-forecast and GCM historical experiments is not.
Between the different glaciated regions of the world – RGI regions (see Fig. 3) – there is considerable variation in skill. Indicated in the histograms in Fig. 4 are the regional mean absolute errors for 2000–2010, which vary considerably. The decadal re-forecast experiment achieves good agreement (regional mean difference ≤ 0.1 m w.e.) in 10 of 18 regions, reasonable agreement (difference 0.1–0.3 m w.e.) in 7 regions and mediocre agreement (difference ≥ 0.3 m w.e.) in 1 region. Variability in MAE is larger for the persistence and GCM historical experiments, with notable overestimation of mass balance due to lagging warming trends in persistence forecasts. The persistence experiment generally performs worse for regions sensitive to rapid warming, such as Greenland and Iceland, highlighting the limitations of static forecasts in a warming climate. Overall, the differences between experiments all lie within 1 standard deviation of the mean, for the simulations as well as within the mean error of observations (Hugonnet et al., 2021). All 2000–2010 observed values and their uncertainty, as well as all simulated values per experiment, are included in Table 3.

Figure 3Map of all RGI regions, of which we use 18 in OGGM as we exclude Antarctica (region 19). Numbered light-blue polygons correspond to the glacier regions (GTN-G, 2023) listed in Tables 4 and 5. Dark-red areas correspond to all glaciers listed in the RGI inventory (RGI Consortium, 2017). The country outlines are made with Natural Earth country polygons (https://www.naturalearthdata.com, last access: 2 January 2024).
The 2010–2020 results are slightly different than for the 2000–2010 period in that the persistence experiment outperforms the decadal re-forecast experiment in terms of the overall skill score. The mean absolute error for the decadal re-forecast experiment is 0.28, while persistence mean absolute error is 0.24, with Pearson correlation coefficients of 0.69 and 0.77, respectively. In terms of the goodness of fit per region however, the decadal re-forecast experiment slightly outperforms the persistence experiment, with 11 of 18 regions showing a good fit (defined as a difference between regional means ≤ 0.1 m w.e.). Of the 18 regions, 5 show a reasonable fit (difference between regional means 0.1–0.3 m w.e.) and 2 show a mediocre fit (difference between regional means ≥ 0.3 m w.e.). The persistence experiment shows 8 regions with good fit, 8 regions with reasonable fit and two regions with mediocre fit. The exact goodness-of-fit numbers, including the observations and observational uncertainty, can be found in Table 5.
The fact that the persistence experiment performs markedly better for this decade than for the previous one, while skill scores and goodness of fit are similar for the decadal re-forecast experiment, lies in the calibration. As explained in Sect. 2.2.2, the calibration and validation periods are the same. Because of the nature of persistence forecasts, the forcing data for the 2000–2010 period originated from 1990–2000 and were not used in calibration. For the period 2010–2020, however, both the climate data for the persistence forecast (2000–2010) and the forecasted decade were part of the calibration, resulting in a bias of 0 for the full time period of 2000–2020. To assess the effect of this, we run another persistence simulation, this time with a model only calibrated for the period 2000–2010, leaving 2010–2020 for validation. This is also the scenario that would occur when using persistence to forecast 2020–2030: the model is calibrated for the decade prior to the forecasted one. This leads to a markedly worse score for the persistence forecast, with a mean absolute error of 0.38 ± 0.36 m w.e. and a Pearson correlation coefficient of 0.26, indicating low correlation. For the sake of consistency, keeping calibration the same for all experiments, the persistence experiment results presented in Table 5 are from the simulation with the original calibration.
Table 3Observed and simulated global mass balance skill scores. Summary of the comparison between global observed and simulated mean mass balances for all three experiments in the decades 2000–2010 and 2010–2020. The statistics shown are ME (model error), MAE (mean absolute error) and Pearson correlation, as well as half the interquartile range for the particular statistic.


Figure 4Mean absolute error for runs with decadal re-forecast forcing (a), persistence forcing (b) and GCM historical forcing (c), for all RGI regions (N=18). The vertical lines indicate the mean (bold, black), median (dotted, black), and 5th and 95th percentiles (grey) of the mean absolute error.
Table 4Overview of mass balances over the period 2000–2010, from the decadal re-forecast, persistence and GCM historical experiments, as well as observed data from Hugonnet et al. (2021). The Hugonnet et al. (2021) values include their uncertainties. Values are formatted according to the goodness of fit between experiment and Hugonnet et al. (2021) data: cells in bold indicate a good fit (difference between regional means ≤ 0.1 m w.e.), cells in a normal font indicate a reasonable fit (difference between regional mean 0.1–0.3 m w.e.) and cells in italics indicate a mediocre fit (difference between regional mean ≥ 0.3 m w.e.).

Table 5Color-coded overview of mass balances over the period 2010–2020, from the decadal re-forecast, persistence and GCM historical/projection experiments, as well as observed data from Hugonnet et al. (2021). The Hugonnet et al. (2021) values include their uncertainties. The table is color coded according to goodness of fit between experiment and Hugonnet et al. (2021) data: cells in bold indicate a good fit (difference between regional means ≤ 0.1 m w.e.), cells in regular font a reasonable fit (difference between regional mean 0.1–0.3 m w.e.) and cells in italics indicate mediocre fit (difference between regional mean ≥ 0.3 m w.e.).

3.3 Observed data differences
Finally, this study clearly benefits from the availability of two separate datasets of observed mass balance for the same 2000–2020 time period. This allows for more critical assessment not only of model results, but also of uncertainty within observations. The use of both datasets here warrants a comparison between overlapping observations. For the period 2000–2010, we calculate the mean mass balance for all glaciers where the WGMS dataset has full observations throughout the decade (N=90). For this subset of 90 glaciers, we find a mean bias/difference of −0.049 m w.e. and an absolute bias/difference of 0.23 m w.e. between the WGMS and geodetic mass balance data. For the period 2010–2020 we have 100 glaciers with uninterrupted mass balance coverage from the WGMS dataset. Comparing to the Hugonnet et al. (2021) mean mass balances yields a mean bias/difference of −0.15 m w.e. and an absolute bias/difference of 0.32 m w.e. For the full time period of 2000–2020, a subset of 67 glaciers has observations throughout the full time period. The mean bias/difference here is −0.11 m w.e., and the absolute bias/difference is 0.23 m w.e. These results are of the same magnitude of error as that which we observe when comparing our decadal re-forecast simulation results to observations, as well as the uncertainties associated with the Hugonnet et al. (2021) data (see Tables 4 and 5). This not only reinforces the need for caution when interpreting observations but also confirms the satisfactory quality of the simulated results.
3.4 Skill
On the whole, the skill displayed in the decadal re-forecast experiment is comparable to or slightly better than in the other experiments. Regional differences in skill (“goodness of fit” in the global component) in the re-forecast experiment likely stem from differences in re-forecast quality. This means the degree of correspondence between observed and simulated temperature and precipitation. We refer to Delgado-Torres et al. (2022) for a comprehensive analysis of the quality of the re-forecasts used here. Their results show generally high skill for DCPP forecasts of temperature, especially over land masses. For precipitation, however, skill is limited in several regions, including central Europe (region 11) and Western Canada/US (region 2), which also show the least mass balance skill out of all regions (see Tables 4 and 5). Good precipitation skill is observed for northern Europe and Central Asia, in line with our yielded “good fit” results for Svalbard and Jan Mayen (region 7) and Central Asia (region 13) (Tables 4 and 5). Delgado-Torres et al. (2022) find the quality of decadal re-forecasts of temperature to be higher than that of historical temperature simulations for multiple regions, while added value is smaller when simulating temperature. With accurate representation of precipitation being essential for mass balance modeling, it is likely that precipitation forecasts are a limiting factor in seeing significant improvement when simulating near-term mass balance with decadal re-forecasts as forcing, over persistence or historical/projection data. Additionally, addressing precipitation in a more complex calibration and downscaling approach may decrease uncertainty in this area. This is to be kept in mind when designing future studies, especially if these include regions where predictive skill for precipitation is low.
The primary source of decadal (re-)forecast skill, beyond free-running simulations, is initialization. The main benefit of initialized decadal forecasts is that the initialization allows them to capture both the response to external forcing and the phase of the internal variability in the climate system. For example, initialized re-forecasts better capture Atlantic multi-decadal variability than GCM historical simulations in a study by García-Serrano et al. (2015). Smith et al. (2019) note that improvements from initialization generally take place in regions where the uninitialized simulations already have some skill. This can also be seen in the similar skill patterns between the GCM historical and decadal re-forecast experiments in Tables 4 and 5. Initialized forecasts better represent internal climate variability due to accurate initial conditions (Smith et al., 2019). This enhances their predictive skill for timescales of 1–10 years compared to uninitialized projections. Initialized decadal forecasts also provide a more constant uncertainty over time than uninitialized projects, whose uncertainty increases significantly with lead time (Strobach and Bel, 2017). Future improvements in initialization, e.g., regarding the initialization of the North Atlantic Oscillation (Nicolì et al., 2025), may therefore still offer potential for reducing uncertainties in near-term glacier modeling, even if the current benefits are limited. In addition, decadal forecasts outperform uninitialized projections in representing regional climate variability, especially in temperature and precipitation, which are crucial for accurately modeling glacier mass balance. As studies such as Thornton et al. (2014) describe, climate variability often exacerbates the impact of climate change on vulnerable communities. In a glacier context, this could mean, for example, above-average melt events impacting downstream communities, so accurate prediction is essential. Finally, Payne et al. (2022) note in their assessment of forcing with decadal re-forecasts vs. uninitialized projections that there is an established demand for communicating both the likely values and the uncertainty in a forecast made with an impact model (Bruno Soares and Dessai, 2016). When the effort is made to minimize this uncertainty and make a forecast that is as precise as possible, forcing with initialized forecasts is likely preferable on the decadal scale. Despite these advantages, decadal forecasts are still in development, and the continuation of projects such as the DCPP contribution to CMIP6 (Boer et al., 2016) is essential to ensure their operational use.
This study also reveals arguments for applying decadal (re-)forecasts over persistence or uninitialized projections in future near-term glacier modeling studies. Especially warming and glacier mass loss accelerating (Hugonnet et al., 2021) support an argument for forcing near-term simulations with initialized forecasts over persistence forecasts, which lack the warming trend over the simulation period. The choice of initialized forecast forcing over uninitialized projections stems mainly from the probabilistic context: projections, especially as pathway differences increase over time, are inherently more uncertain than a forecast initialized at the beginning of the forecast period. Statistically, we observe no significant difference between the decadal re-forecast and GCM historical experiment per individual glacier (Sect. 3.1). Binomial tests however show a general improvement when forcing OGGM with decadal re-forecasts. Also, Smith et al. (2019) note that significance tests, such as those applied here, often underestimate the improvement from initialization because there is a significant overlap of skill in both experiments (e.g., simulating the global warming trend). Smith et al. (2019) state, “This common signal introduces a bias that is not taken into account in standard significance tests and diminishes their power (Siegert et al., 2017).” Therefore, the small improvements in the skill statistics of the decadal re-forecast experiment over the other experiments may indicate a larger benefit of forcing glacier models with decadal re-forecasts than is evident in Student's t test. This indicates that even modest skill increases from initialization may be reliably attributed to improved forecast initialization rather than to random chance.
Our results show that there is merit in using decadal-scale forecasts in glacier modeling, as they show good predictive skill of averaged multi-annual mass balances. We see that, indicated by lower errors and higher correlations, the use of decadal re-forecasts yields results that are comparable to or better than forcing OGGM with a persistence forecast or the current state of the art: GCM historical data of temperature and precipitation. When forcing OGGM with decadal re-forecasts, a binomial test shows improvement for the majority of the WGMS glaciers and, globally, we see good or reasonable agreement between simulated and observed mean mass balances for almost all RGI regions and on a glacier-to-glacier basis. Forcing with both GCM historical/projection simulations and decadal re-forecasts yields skillful predictions of cumulative mass balance over single decades for the WGMS set of reference glaciers, providing an important basis for modeling the amount of mass moving downstream over a decade. Planning future studies with these forcings of course operates on the assumption that real-time decadal forecasts (for decades that lie in the future) and GCM projections would be of similar quality to the re-forecasts and historical runs used in the current study, and this would benefit from future validation. The use of decadal forecasts would not replace GCM projections for 21st-century glacier modeling but can provide added clarity in the near term, especially in terms of uncertainty.
The results shown here are limited by multiple factors, and we especially highlight the need for continuing this research with a larger ensemble, which could increase predictive skill (Smith et al., 2013). We also propose a more detailed look into ensemble spread, explicitly quantifying and incorporating ensemble spread as a measure of uncertainty. Probabilistic evaluation methods, such as the continuous ranked probability score (CRPS; Hersbach, 2000), could better utilize the full predictive distribution provided by ensemble forecasts, thus giving a more comprehensive picture of forecast uncertainty and reliability. Another important step towards applications in hydrology and industry would be the use of decadal forecasts to force OGGM dynamically, as opposed to the static mass balance in the current study. This would mainly serve to ensure a more accurate initial state of the glacier, important for areas where glaciers have already changed significantly since their RGI inventory date, such as in the European Alps.
Finally, the foremost aim when continuing this research is to have the highest possible quality of near-term glacier simulations for the next decade. Accurate knowledge of near-term trends is essential, as these timescales are most relevant to applications in hydrology and industry (Frans et al., 2016; Lane and Nienow, 2019; Arheimer et al., 2024), especially in regions where populations are directly affected in the form of water scarcity or flooding. This work would add to a growing database of field cases utilizing near-term forecasts; see, for example, O'Kane et al. (2023). Our results support the case for using decadal forecasts to achieve this, rather than depending only on the continuation or repetition of recent decadal climatic conditions (as in persistence forecasts). The next applications of the methods laid out in this study would be on basin and global scales, forcing OGGM with a multi-model ensemble of decadal forecasts, into the 2030s. OGGM would be applied to acquire decadal estimates of future mean and cumulative mass balance, volume and area change, and glacier runoff. Results could provide robust, important information on the amounts of glacier mass lost and moving downstream in the form of runoff. With the continuing and accelerating impacts of climate change on glaciers and water resources, we emphasize the need for these near-term predictions in order to best inform and protect the communities dependent on such resources.
The two figures in this appendix have been produced as an example of results for a single glacier. As can be seen, neither the GCM historical nor the decadal re-forecast experiments perform very well when analyzed at the single-glacier level, in this case for the Hintereisferner and Langfjordjoekulen. In the Hintereisferner case, also cumulatively, neither the GCM historical nor the decadal re-forecasts have provided more skill than the very simple persistence experiment. The ensemble spread, in this case, is even larger than for the GCM historical experiment, which does not speak for its benefits of initialization. In the Langfjordjoekulen case, the decadal re-forecast experiment performs better than the GCM historical and persistence experiments. It does not follow the year-to-year variations but captures the mean mass balance over the decade much better than the other two experiments. Overall, simulating single glaciers with OGGM is not the model's fortitude. The model is calibrated using the baseline climate and publicly available data, which either are sparse (WGMS) or only provide a decadal mean (geodetic global dataset; Hugonnet et al., 2021). Thus, the overall benefits of using a different climate forcing, such as in this study, should be determined over a larger set of glaciers.

Figure A1Single-glacier result example for the Hintereisferner, Austria. Mean errors (MEs) for the different experiments, for mean mass balance over the decade, as in Table 2, are indicated in the legend. These give the difference between the mean observed mass balance and the mean simulated mass balance for the different experiments. For the GCM historical and decadal RF experiments, “simulated” refers to the ensemble mean.

Figure A2Single-glacier result example for the Langfjordjoekulen, Norway. Mean errors (MEs) for the different experiments, for mean mass balance over the decade, as in Table 2, are indicated in the legend. These give the difference between the mean observed mass balance and the mean simulated mass balance for the different experiments. For the GCM historical and decadal RF experiments, “simulated” refers to the ensemble mean.
The documentation of the OGGM is available at https://docs.oggm.org/en/v1.5.3/ (last access: 3 May 2024) and https://doi.org/10.5281/zenodo.6408559 (Maussion et al., 2022). The climate model data used in this study are openly available from the Earth System Grid Federation (ESGF) at the following URL: https://aims2.llnl.gov/search/cmip6/ (last access: 25 May 2024). The OGGM baseline climate CRU is available in the OGGM pre-processed directories at https://cluster.klima.uni-bremen.de/~oggm/gdirs/oggm_v1.4/exps/CRU_new/elev_bands/qc0/pcp2.5/match_geod_pergla/ (last access: 1 May 2025) (Maussion et al., 2022).
LNvdL helped conceive the study, co-developed the drift correction, did the analysis and wrote the paper. AV ran the global simulations and improved the writing and general study design. AAS contributed to the study design and development of the drift correction. FM is the main developer of OGGM and contributed to the study design, model setup and improving the text. KF set up the original study concept and contributed to discussions and improving writing.
The contact author has declared that none of the authors has any competing interests.
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.
Larissa Nora van der Laan, employed by the project Global glacier mass balance prediction on seasonal and decadal scale “GLISSADE”, was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation), grant no. 416069075 (FO1269/1). Anouk Vlug was funded by the Austrian Science Fund (FWF) project P30256 and the German Research Foundation (DFG) – MA 6966/5-1. Adam A. Scaife was supported by the Met Office Hadley Centre Climate Programme (HCCP) funded by the UK Department for Science, Innovation and Technology (DSIT). The authors would like to thank Mark Payne for the helpful discussion on decadal forecasting and Lizz Ultee for her encouragement while completing this study.
This research has been supported by the Deutsche Forschungsgemeinschaft (grant nos. 416069075 (FO1269/1) and MA6966/5-1), the Austrian Science Fund (grant no. P30256) and the Met Office (Met Office Hadley Centre Climate Programme (HCCP) funded by the UK Department for Science, Innovation and Technology (DSIT)).
The publication of this article was funded by the open-access fund of Leibniz Universität Hannover.
This paper was edited by Ian Delaney and reviewed by Kristoffer Aalstad and two anonymous referees.
Allison, E. A.: The Spiritual Significance of Glaciers in an Age of Climate Change, WIREs Clim. Change, 6, 493–508, https://doi.org/10.1002/wcc.354, 2015. a
Arheimer, B., Cudennec, C., Castellarin, A., et al.: The IAHS Science for Solutions decade, with Hydrology Engaging Local People IN one Global world (HELPING), Hydrolog. Sci. J., 69, 1417–1435, https://doi.org/10.1080/02626667.2024.2355202, 2024. a
Bethke, I., Wang, Y., Counillon, F., Keenlyside, N., Kimmritz, M., Fransner, F., Samuelsen, A., Langehaug, H., Svendsen, L., Chiu, P.-G., Passos, L., Bentsen, M., Guo, C., Gupta, A., Tjiputra, J., Kirkevåg, A., Olivié, D., Seland, Ø., Solsvik Vågane, J., Fan, Y., and Eldevik, T.: NorCPM1 and its contribution to CMIP6 DCPP, Geosci. Model Dev., 14, 7073–7116, https://doi.org/10.5194/gmd-14-7073-2021, 2021. a
Boer, G. J., Smith, D. M., Cassou, C., Doblas-Reyes, F., Danabasoglu, G., Kirtman, B., Kushnir, Y., Kimoto, M., Meehl, G. A., Msadek, R., Mueller, W. A., Taylor, K. E., Zwiers, F., Rixen, M., Ruprich-Robert, Y., and Eade, R.: The Decadal Climate Prediction Project (DCPP) contribution to CMIP6, Geosci. Model Dev., 9, 3751–3777, https://doi.org/10.5194/gmd-9-3751-2016, 2016. a, b, c, d, e, f, g
Bosson, J. B., Huss, M., and Osipova, E.: Disappearing World Heritage Glaciers as a Keystone of Nature Conservation in a Changing Climate, Earth's Future, 7, 469–479, https://doi.org/10.1029/2018EF001139, 2019. a
Bruno Soares, M. and Dessai, S.: Barriers and enablers to the use of seasonal climate forecasts amongst organisations in Europe, Climatic Change, 137, 89–103, https://doi.org/10.1007/s10584-016-1671-8, 2016. a
Counillon, F., Keenlyside, N., Bethke, I., Wang, Y., Billeau, S., Shen, M. L., and Bentsen, M.: Flow-dependent assimilation of sea surface temperature in isopycnal coordinates with the Norwegian Climate Prediction Model, Tellus A, 68, 32437, https://doi.org/10.3402/tellusa.v68.32437, 2016. a
Delgado-Torres, C., Donat, M. G., Gonzalez-Reviriego, N., Caron, L.-P., Athanasiadis, P. J., Bretonnière, P.-A., Dunstone, N. J., Ho, A.-C., Pankatz, K., Paxian, A., Pérez-Zanón, N., Samsó Cabré, M., Solaraju-Murali, B., Soret, A., and Doblas-Reyes, F. J.: Multi-model forecast quality assessment of CMIP6 decadal predictions, EGU General Assembly 2022, Vienna, Austria, 23–27 May 2022, EGU22-13156, https://doi.org/10.5194/egusphere-egu22-13156, 2022. a, b, c
Dunstone, N., Lockwood, J., Solaraju-Murali, B., Reinhardt, K., Tsartsali, E. E., Athanasiadis, P. J., Bellucci, A., Brookshaw, A., Caron, L.-P., Doblas-Reyes, F. J., Früh, B., González-Reviriego, N., Gualdi, S., Hermanson, L., Materia, S., Nicodemou, A., Nicolì, D., Pankatz, K., Paxian, A., Scaife, A., Smith, D., and Thornton, H. E.: Towards Useful Decadal Climate Services, B. Am. Meteorol. Soc., 103, E1705–E1719, https://doi.org/10.1175/bams-d-21-0190.1, 2022. a
Eis, J., Van der Laan, L., Maussion, F., and Marzeion, B.: Reconstruction of past glacier changes with an ice-flow glacier model: Proof of concept and validation, Frontiers in Earth Science, 9, 595755, https://doi.org/10.3389/feart.2021.595755, 2021. a, b
Farinotti, D., Huss, M., Fürst, J. J., Landmann, J., Machguth, H., Maussion, F., and Pandit, A.: A consensus estimate for the ice thickness distribution of all glaciers on Earth, Nat. Geosci., 12, 168–173, https://doi.org/10.1038/s41561-019-0300-3, 2019. a
Farinotti, D., Immerzeel, W. W., de Kok, R. J., Quincey, D. J., and Dehecq, A.: Manifestations and Mechanisms of the Karakoram Glacier Anomaly, Nat. Geosci., 13, 8–16, https://doi.org/10.1038/s41561-019-0513-5, 2020. a
Frans, C., Istanbulluoglu, E., Lettenmaier, D. P., Clarke, G., Bohn, T. J., and Stumbaugh, M.: Implications of Decadal to Century Scale Glacio-hydrological Change for Water Resources of the Hood River Basin, OR, USA, Hydrol. Process., 30, 4314–4329, https://doi.org/10.1002/hyp.10872, 2016. a, b
Frederikse, T., Landerer, F., Caron, L., Adhikari, S., Parkes, D., Humphrey, V. W., Dangendorf, S., Hogarth, P., Zanna, L., Cheng, L., and Wu, Y.-H.: The causes of sea-level rise since 1900, Nature, 584, 393–397, https://doi.org/10.1038/s41586-020-2591-3, 2020. a, b
Förster, K. and van der Laan, L. N.: A review on observed historical changes in hydroclimatic extreme events over Europe, in: Climate Impacts on Extreme Weather, edited by: Ongoma, V. and Tabari, H., Elsevier, 131–144, ISBN 978-0-323-88456-3, https://doi.org/10.1016/B978-0-323-88456-3.00015-0, 2022. a
Förster, K., Hanzer, F., Stoll, E., Scaife, A. A., MacLachlan, C., Schöber, J., Huttenlau, M., Achleitner, S., and Strasser, U.: Retrospective forecasts of the upcoming winter season snow accumulation in the Inn headwaters (European Alps), Hydrol. Earth Syst. Sci., 22, 1157–1173, https://doi.org/10.5194/hess-22-1157-2018, 2018. a
García-Serrano, J., Guemas, V., and Doblas-Reyes, F.: Added-value from initialization in predictions of Atlantic multi-decadal variability, Clim. Dynam., 44, 2539–2555, https://doi.org/10.1007/s00382-014-2370-7, 2015. a
Goosse, H., Barriat, P.-Y., Dalaiden, Q., Klein, F., Marzeion, B., Maussion, F., Pelucchi, P., and Vlug, A.: Testing the consistency between changes in simulated climate and Alpine glacier length over the past millennium, Clim. Past, 14, 1119–1133, https://doi.org/10.5194/cp-14-1119-2018, 2018. a
GTN-G: GTN-G Glacier Regions. Global Terrestrial Network for Glaciers, https://doi.org/10.5904/gtng-glacreg-2023-0, https://doi.org/10.5904/gtng-glacreg-2023-07, 2023. a
Hargreaves, J. C.: Skill and uncertainty in climate models, WIREs Clim. Change, 1, 556–564, https://doi.org/10.1002/wcc.58, 2010. a
Harris, I., Osborn, T. J., Jones, P., and Lister, D.: Version 4 of the CRU TS Monthly High-resolution Gridded Multivariate Climate Dataset, Scientific Data, 7, 109, https://doi.org/10.1038/s41597-020-0453-3, 2020. a, b, c, d, e
Hermanson, L., Smith, D., Seabrook, M., Bilbao, R., Doblas-Reyes, F., Tourigny, E., Lapin, V., Kharin, V. V., Merryfield, W. J., Sospedra-Alfonso, R., Athanasiadis, P., Nicoli, D., Gualdi, S., Dunstone, N., Eade, R., Scaife, A., Collier, M., O’Kane, T., Kitsios, V., Sandery, P., Pankatz, K., Früh, B., Pohlmann, H., Müller, W., Kataoka, T., Tatebe, H., Ishii, M., Imada, Y., Kruschke, T., Koenigk, T., Karami, M. P., Yang, S., Tian, T., Zhang, L., Delworth, T., Yang, X., Zeng, F., Wang, Y., Counillon, F., Keenlyside, N., Bethke, I., Lean, J., Luterbacher, J., Kolli, R. K., and Kumar, A.: WMO global annual to decadal climate update: a prediction for 2021–25, B. Am. Meteorol. Soc., 103, E1117–E1129, https://doi.org/10.18356/9789210027939, 2022. a
Hersbach, H.: Decomposition of the continuous ranked probability score for ensemble prediction systems, Weather Forecast., 15, 559–570, https://doi.org/10.1175/1520-0434(2000)015<0559:DOTCRP>2.0.CO;2, 2000. a
Hock, R., Bliss, A., Marzeion, B. E. N., Giesen, R. H., Hirabayashi, Y., Huss, M., Radic, V., and Slangen, A. B.: GlacierMIP – A Model Intercomparison of Global-scale Glacier Mass-balance Models and Projections, J. Glaciol., 65, 453–467, https://doi.org/10.1017/jog.2019.22, 2019. a, b
Hossain, M. M., Garg, N., Anwar, A. F., Prakash, M., and Bari, M.: Intercomparison of drift correction alternatives for CMIP5 decadal precipitation, Int. J. Climatol., 42, 1015–1037, https://doi.org/10.1002/joc.7287, 2022. a
Hugonnet, R., McNabb, R., Berthier, E., Menounos, B., Nuth, C., Girod, L., Farinotti, D., Huss, M., Dussaillant, I., Brun, F., and Kääb, A.: Accelerated global glacier mass loss in the early twenty-first century, Nature, 592, 726–731, https://doi.org/10.1038/s41586-021-03436-z, 2021. a, b, c, d, e, f, g, h, i, j, k, l, m, n, o, p
Huss, M. and Hock, R.: Global-scale hydrological response to future glacier mass loss, Nat. Clim. Change, 8, 135–140, https://doi.org/10.1038/s41558-017-0049-x, 2018. a
Huston, A., Siler, N., Roe, G. H., Pettit, E., and Steiger, N. J.: Understanding drivers of glacier-length variability over the last millennium, The Cryosphere, 15, 1645–1662, https://doi.org/10.5194/tc-15-1645-2021, 2021. a
Immerzeel, W. W., Lutz, A. F., Andrade, M., Bahl, A., Biemans, H., Bolch, T., Hyde, S., Brumby, S., Davies, B., Elmore, A., Emmer, A., Feng, M., Fernández, A., Haritashya, U., Kargel, J. S., Koppes, M., Kraaijenbrink, P. D. A., Kulkarni, A. V., Mayewski, P. A., Nepal, S., Pacheco, P., Painter, T. H., Pellicciotti, F., Rajaram, H., Rupper, S., Sinisalo, A., Shrestha, A. B., Viviroli, D., Wada, Y., Xiao, C., Yao, T., and Baillie, J. E. M.: Importance and vulnerability of the world's water towers, Nature, 577, 364–369, https://doi.org/10.1038/s41586-019-1822-y, 2020. a
Jansson, P., Hock, R., and Schneider, T.: The Concept of Glacier Storage: a Review, J. Hydrol., 282, 116–129, https://doi.org/10.1016/S0022-1694(03)00258-0, 2003. a, b
Kadow, C., Illing, S., Kröner, I., Ulbrich, U., and Cubasch, U.: Decadal climate predictions improved by ocean ensemble dispersion filtering, J. Adv. Model. Earth Sy., 9, 1138–1149, https://doi.org/10.1002/2016ms000787, 2017. a, b
Kataoka, T., Tatebe, H., Koyama, H., Mochizuki, T., Ogochi, K., Naoe, H., Imada, Y., Shiogama, H., Kimoto, M., and Watanabe, M.: Seasonal to Decadal Predictions with MIROC6: Description and Basic Evaluation, J. Adv. Model. Earth Sy., 12, e2019MS002035, https://doi.org/10.1029/2019MS002035, 2020. a
Kharin, V. V., Boer, G. J., Merryfield, W. J., Scinocca, J. F., and Lee, W. S.: Statistical Adjustment of Decadal Predictions in a Changing Climate, Geophys. Res. Lett., 39, L19705, https://doi.org/10.1029/2012GL052647, 2012. a, b
Kiem, A. S. and Verdon-Kidd, D. C.: Steps Toward “Useful” Hydroclimatic Scenarios for Water Resource Management in the Murray-Darling Basin, Water Resour. Res., 47, W00G06, https://doi.org/10.1029/2010wr009803, 2011. a
Koziol, C. P. and Arnold, N.: Modelling seasonal meltwater forcing of the velocity of land-terminating margins of the Greenland Ice Sheet, The Cryosphere, 12, 971–991, https://doi.org/10.5194/tc-12-971-2018, 2018. a
Kushnir, Y., Scaife, A. A., Arritt, R., Balsamo, G., Boer, G., Doblas-Reyes, F., Hawkins, E., Kimoto, M., Kolli, R. K., Kumar, A., Matei, D., Matthes, K., Müller, W. A., O'Kane, T., Perlwitz, J., Power, S., Raphael, M., Shimpo, A., Smith, D., Tuma, M., and Wu, B.: Towards operational predictions of the near-term climate, Nat. Clim. Change, 9, 94–101, https://doi.org/10.1038/s41558-018-0359-7, 2019. a
Lane, S. N. and Nienow, P. W.: Decadal‐Scale Climate Forcing of Alpine Glacial Hydrological Systems., Water Resour. Res., 55, 2478–2492, https://doi.org/10.1029/2018WR024206, 2019. a, b
Malles, J.-H. and Marzeion, B.: Twentieth century global glacier mass change: an ensemble-based model reconstruction, The Cryosphere, 15, 3135–3157, https://doi.org/10.5194/tc-15-3135-2021, 2021. a
Manzanas, R.: Assessment of model drifts in seasonal forecasting: Sensitivity to ensemble size and implications for bias correction, J. Adv. Model. Earth Sy., 12, e2019MS001751, https://doi.org/10.1029/2019ms001751, 2020. a
Marzeion, B., Jarosch, A. H., and Hofer, M.: Past and future sea-level change from the surface mass balance of glaciers, The Cryosphere, 6, 1295–1322, https://doi.org/10.5194/tc-6-1295-2012, 2012. a, b
Marzeion, B., Champollion, N., Haeberli, W., Langley, K., Leclercq, P., and Paul, F.: Observation-based estimates of global glacier mass change and its contribution to sea-level change, Surv. Geophys., 38, 107–132, https://doi.org/10.1007/s10712-016-9394-y, 2017. a
Maussion, F., Butenko, A., Champollion, N., Dusch, M., Eis, J., Fourteau, K., Gregor, P., Jarosch, A. H., Landmann, J., Oesterle, F., Recinos, B., Rothenpieler, T., Vlug, A., Wild, C. T., and Marzeion, B.: The Open Global Glacier Model (OGGM) v1.1, Geosci. Model Dev., 12, 909–931, https://doi.org/10.5194/gmd-12-909-2019, 2019. a, b, c, d, e, f, g
Maussion, F., Rothenspieler, T., Dusch, M., Vlug, A., Schuster, L., Schmitt, P., Champollion, N., Marzeion, B., Li, F., Oberrauch, M., Landmann, J., Eis, J., Jarosch, A., Hanus, S., Rounce, D., Castellani, M., Bartholomew, S., Merrill, C., Loibl, D., Ultee, L., Minallah, S., Thompson, S., and Ub, A. Gregor, P.: OGGM/oggm: v1.5.3, Zenodo [code], https://doi.org/10.5281/zenodo.6408559, 2022. a
Merryfield, W. J., Baehr, J., Batté, L., et al.: Current and emerging developments in subseasonal to decadal prediction, B. Am. Meteorol. Soc., 101, E869–E896, https://doi.org/10.1175/BAMS-D-19-0037.1, 2020. a
New, M., Lister, D., Hulme, M., and Makin, I.: A high-resolution data set of surface climate over global land areas, Clim. Res., 21, 1–25, https://doi.org/10.3354/cr021001, 2002. a, b, c
Nicolì, D., Gualdi, S., and Athanasiadis, P.: Decadal predictions outperform climate projections in forecasting Mediterranean wintertime precipitation, Environ. Res. Lett., 20, 034034, https://doi.org/10.1088/1748-9326/adb59e, 2025. a
O'Kane, T. J., Scaife, A. A., Kushnir, Y., Brookshaw, A., Buontempo, C., Carlin, D., Connell, R. K., Doblas-Reyes, F., Dunstone, N., Förster, K., Graça, A., Hobday, A. J., Kitsios, V., van der Laan, L., Lockwood, J., Merryfield, W. J., Paxian, A., Payne, M. R., Reader, M. C., Saville, G. R., Smith, D., Solaraju-Murali, B., Caltabiano, N., Carman, J., Hawkins, E., Keenlyside, N., Kumar, A., Matei, D., Pohlmann, H., Power, S., Raphael, M., Sparrow, M., and Wu, B.: Recent applications and potential of near-term (interannual to decadal) climate predictions, Frontiers in Climate, 5, https://doi.org/10.3389/fclim.2023.1121626, 2023. a, b
Pasternack, A., Grieger, J., Rust, H. W., and Ulbrich, U.: Recalibrating decadal climate predictions – what is an adequate model for the drift?, Geosci. Model Dev., 14, 4335–4355, https://doi.org/10.5194/gmd-14-4335-2021, 2021. a
Payne, M. R., Danabasoglu, G., Keenlyside, N., Matei, D., Miesner, A. K., Yang, S., and Yeager, S. G.: Skilful decadal-scale prediction of fish habitat and distribution shifts, Nat. Commun., 13, 2660, https://doi.org/10.1038/s41467-022-30280-0, 2022. a, b, c
Pfeffer, W. T., Arendt, A. A., Bliss, A., Bolch, T., Cogley, J. G., Gardner, A. S., Hagen, J.-O., Hock, R., Kaser, G., Kienholz, C., and The Randolph Consortium: The Randolph Glacier Inventory: A Globally Complete Inventory of Glaciers, J. Glaciol., 60, 537–552, https://doi.org/10.3189/2014JoG13J176, 2014. a
Ramírez Villegas, J. and Jarvis, A.: Downscaling global circulation model outputs: the delta method decision and policy analysis Working Paper No. 1, https://hdl.handle.net/10568/90731 (last access: 1 June 2024), 2010. a
Recinos, B., Maussion, F., Rothenpieler, T., and Marzeion, B.: Impact of frontal ablation on the ice thickness estimation of marine-terminating glaciers in Alaska, The Cryosphere, 13, 2657–2672, https://doi.org/10.5194/tc-13-2657-2019, 2019. a
RGI Consortium: Randolph Glacier Inventory (RGI) – A Dataset of Global Glacier Outlines: Version 6.0, Global Land Ice Measurements from Space [data set], https://doi.org/10.7265/N5-RGI-60, 2017. a, b
Risbey, J. S., Squire, D. T., Black, A. S., DelSole, T., Lepore, C., Matear, R. J., Monselesan, D. P., Moore, T. S., Richardson, D., Schepen, A., Tippett, M. K., and Tozer, C. R.: Standard assessments of climate forecast skill can be misleading, Nat. Commun., 12, 4346, https://doi.org/10.1038/s41467-021-23771-z, 2021. a
Roe, G. H., Christian, J. E., and Marzeion, B.: On the attribution of industrial-era glacier mass loss to anthropogenic climate change, The Cryosphere, 15, 1889–1905, https://doi.org/10.5194/tc-15-1889-2021, 2021. a
Rounce, D. R., Hock, R., Maussion, F., Hugonnet, R., Kochtitzky, W., Huss, M., Berthier, E., Brinkerhoff, D., Compagno, L., Copland, L., Farinotti, D., Menounos, B., and McNabb, R. W.: Global glacier change in the 21st century: Every increase in temperature matters, Science, 379, 78–83, https://doi.org/10.1126/science.abo1324, 2023. a, b
Réveillet, M., Six, D., Vincent, C., Rabatel, A., Dumont, M., Lafaysse, M., Morin, S., Vionnet, V., and Litt, M.: Relative performance of empirical and physical models in assessing the seasonal and annual glacier surface mass balance of Saint-Sorlin Glacier (French Alps), The Cryosphere, 12, 1367–1386, https://doi.org/10.5194/tc-12-1367-2018, 2018. a
Siegert, S., Bellprat, O., Ménégoz, M., Stephenson, D. B., and Doblas-Reyes, F. J.: Detecting improvements in forecast correlation skill: Statistical testing and power analysis, Mon. Weather Rev., 145, 437–450, https://doi.org/10.1175/mwr-d-16-0037.1, 2017. a
Slangen, A., Adloff, F., Jevrejeva, S., Leclercq, P., Marzeion, B., Wada, Y., and Winkelmann, R.: A review of recent updates of sea-level projections at global and regional scales, Integrative Study of the Mean Sea level and Its Components, in: Integrative Study of the Mean Sea Level and Its Components. Space Sciences Series of ISSI, vol 58, edited by: Cazenave, A., Champollion, N., Paul, F., and Benveniste, J., Springer, Cham, 395–416, https://doi.org/10.1007/978-3-319-56490-6_17, 2017. a, b
Smith, D., Eade, R., Scaife, A., Caron, L.-P., Danabasoglu, G., DelSole, T., Delworth, T., Doblas-Reyes, F., Dunstone, N., Hermanson, L., Kharin, V., Kimoto, M., Merryfield, W. J., Mochizuki, T., Müller, W. A., Pohlmann, H., Yeager, S., and Yang, X.: Robust skill of decadal climate predictions, Npj Climate and Atmospheric Science, 2, 13, https://doi.org/10.1038/s41612-019-0071-y, 2019. a, b, c, d
Smith, D. M., Eade, R., and Pohlmann, H.: A comparison of full-field and anomaly initialization for seasonal to decadal climate prediction, Clim. Dynam., 41, 3325–3338, https://doi.org/10.1007/s00382-013-1683-2, 2013. a
Solaraju-Murali, B., Bojovic, D., Gonzalez-Reviriego, N., Nicodemou, A., Terrado, M., Caron, L. P., and Doblas-Reyes, F. J.: How Decadal Predictions Entered the Climate Services Arena: An Example from the Agriculture Sector, Climate Services, 27, 100303, https://doi.org/10.1016/j.cliser.2022.100303, 2022. a
Strobach, E. and Bel, G.: The contribution of internal and model variabilities to the uncertainty in CMIP5 decadal climate predictions, Clim. Dynam., 49, 3221–3235, https://doi.org/10.1007/s00382-016-3507-7, 2017. a
Tatebe, H., Ogura, T., Nitta, T., Komuro, Y., Ogochi, K., Takemura, T., Sudo, K., Sekiguchi, M., Abe, M., Saito, F., Chikira, M., Watanabe, S., Mori, M., Hirota, N., Kawatani, Y., Mochizuki, T., Yoshimura, K., Takata, K., O'ishi, R., Yamazaki, D., Suzuki, T., Kurogi, M., Kataoka, T., Watanabe, M., and Kimoto, M.: Description and basic evaluation of simulated mean state, internal variability, and climate sensitivity in MIROC6, Geosci. Model Dev., 12, 2727–2765, https://doi.org/10.5194/gmd-12-2727-2019, 2019. a
Thornton, P. K., Ericksen, P. J., Herrero, M., and Challinor, A. J.: Climate variability and vulnerability to climate change: a review, Glob. Change Biol., 20, 3313–3328, https://doi.org/10.1111/gcb.12581, 2014. a
Ultee, L., Coats, S., and Mackay, J.: Glacial runoff buffers droughts through the 21st century, Earth Syst. Dynam., 13, 935–959, https://doi.org/10.5194/esd-13-935-2022, 2022. a
Vargo, L. J., Anderson, B. M., Dadić, R., Horgan, H. J., Mackintosh, A. N., King, A. D., and Lorrey, A. M.: Anthropogenic warming forces extreme annual glacier mass loss, Nat. Clim. Change, 10, 856–861, https://doi.org/10.1038/s41558-020-0849-2, 2020. a
WGMS: Reference Glaciers for Mass Balance, WGMS [data set], https://wgms.ch/products_ref_glaciers/, last access: 1 December 2022. a
Zekollari, H., Huss, M., and Farinotti, D.: On the Imbalance and Response Time of Glaciers in the European Alps, Geophys. Res. Lett., 47, e2019GL085578, https://doi.org/10.1029/2019GL085578, 2020. a
Zekollari, H., Huss, M., Farinotti, D., and Lhermitte, S.: Ice-Dynamical Glacier Evolution Modeling – A Review, Rev. Geophys., 60, e2021RG000754, https://doi.org/10.1029/2021rg000754, 2022. a
Zekollari, H., Huss, M., Schuster, L., Maussion, F., Rounce, D. R., Aguayo, R., Champollion, N., Compagno, L., Hugonnet, R., Marzeion, B., Mojtabavi, S., and Farinotti, D.: Twenty-first century global glacier evolution under CMIP6 scenarios and the role of glacier-specific observations, The Cryosphere, 18, 5045–5066, https://doi.org/10.5194/tc-18-5045-2024, 2024. a
Zhou, T., Wang, B., Yu, Y.-Q., Liu, Y., Zheng, W., Li, L., Wu, B., Lin, P., Guo, Z., Man, W., Bao, Q., Duan, A., Liu, H., Chen, X., He, B., Li, J., Zou, L., Wang, X., Zhang, L., and Zhang, W.: The FGOALS Climate System Model as a Modeling Tool for Supporting Climate Sciences: An Overview, Earth and Planetary Physics, 2, 276–291, 2018. a
Zhu, E., Yuan, X., and Wood, A. W.: Benchmark decadal forecast skill for terrestrial water storage estimated by an elasticity framework, Nat. Commun., 10, 1237, https://doi.org/10.1038/s41467-019-09245-3, 2019. a