Articles | Volume 18, issue 11
https://doi.org/10.5194/tc-18-4955-2024
https://doi.org/10.5194/tc-18-4955-2024
Research article
 | 
04 Nov 2024
Research article |  | 04 Nov 2024

A simple snow temperature index model exposes discrepancies between reanalysis snow water equivalent products

Aleksandra Elias Chereque, Paul J. Kushner, Lawrence Mudryk, Chris Derksen, and Colleen Mortimer
Abstract

Current global reanalyses show marked discrepancies in snow mass and snow cover extent for the Northern Hemisphere. Here, benchmark snow datasets are produced by driving a simple offline snow model, the Brown Temperature Index Model (B-TIM), with temperature and precipitation from three reanalyses. The B-TIM offline snow performs comparably to or better than online (coupled land–atmosphere) reanalysis snow when evaluated against in situ snow measurements. Sources of discrepancy in snow climatologies, which are difficult to isolate when comparing online reanalysis snow products amongst themselves, are partially elucidated by separately bias-adjusting temperature and precipitation in the B-TIM. Interannual variability in snow mass and snow spatial patterns is far more self-consistent amongst offline B-TIM snow products than amongst online reanalysis snow products, and the self-consistent products are more similar to in situ observations, as evaluated in a validation study. Specific artifacts related to temporal inhomogeneity in snow data assimilation are revealed in the analysis. The B-TIM, released here as an open-source, self-contained Python package, provides a simple benchmarking tool for future updates to more sophisticated online and offline snow datasets.

1 Introduction

Terrestrial snow is a highly variable component of the cryosphere that responds to and feeds back on anthropogenic global warming via snow albedo (e.g., Betts et al., 2014; Thackeray et al., 2018). At its maximum, snow covers up to 50 % of the Northern Hemisphere land surface (Robinson and Frei, 2000), and it controls a wide range of hydrological, ecological, and socio-economic systems (Bokhorst et al., 2016). Snow variability and trends have been monitored over several decades (Doesken and Judson, 1997; Mudryk et al., 2020; Robinson, 1989) with regular reporting, such as in the annual National Oceanic and Atmospheric Administration (NOAA) Arctic Report Card. Despite this attention to snow, there are marked discrepancies in historical snow estimates from available products, leading to gaps in our understanding of snow across a range of spatial scales, from point to watershed to hemispheric (Magnusson et al., 2015; Mudryk et al., 2015). Many factors lead to these discrepancies, making it a challenge to identify a single authoritative dataset for historical snow water equivalent or related variables. Furthermore, the simplest snow models can perform comparably to the most complex snow models in relation to the available in situ observations (Boone and Etchevers, 2001; Essery et al., 2013; Magnusson et al., 2015; Menard et al., 2021). For this reason, “offline” datasets generated with temperature index models (TIMs), snow models forced only by air temperature and precipitation that do not represent coupling of snow to the land–atmosphere system, are still maintained (e.g., Hock, 2003; Ohmura, 2001; Sturm, 2015; Walter et al., 2005). Recent studies have advocated for the use of multi-product ensembles spanning a range of complexities (including offline snow models, land surface data assimilation systems, and coupled atmosphere–land reanalysis systems) and a range of snow schemes from single-layer to multilayer snow modules embedded inside comprehensive land surface models. These ensembles can then be used to characterize snow climatology and trends (e.g., Mudryk et al., 2024), evaluate new snow datasets, or quantify uncertainties (Essery, 2015; Kim et al., 2021; Mudryk et al., 2015). Methods to evaluate the quality of potential ensemble members are actively being explored.

In this study, we use an offline TIM to investigate the discrepancies in snow water equivalent (SWE) and snow cover extent (SCE) in online reanalysis snow products. We use an updated version of the Brown et al. (2003) TIM, hereafter called the B-TIM (Brown Temperature Index Model), whose broad applicability and extensive legacy at Environment and Climate Change Canada (ECCC) motivate its use. The model was initially developed to provide a first-guess field for a gridded snow analysis using forcing from the European Centre for Medium-range Weather Forecasts (ECMWF) Reanalysis 15 (ERA-15). The snow analysis was used to evaluate global climate model output from AMIP II. Later, using forcing from numerical weather forecasts to run the B-TIM, the model was incorporated into the Canadian Meteorological Centre's (CMC's) daily snow depth analysis (Brown and Brasnett, 2010). This dataset continues to be used as a validation product for other studies (Kim et al., 2021; Zhang et al., 2014). Until recently, the standalone version of the B-TIM that is internally available at ECCC was coded in Fortran 77 and forced with temperature and precipitation forcing from ERA-Interim (Dee et al., 2011). The ERA-Interim version participated in several ensemble studies (e.g., Brown et al., 2010; Brown and Robinson, 2011; Mortimer et al., 2020) and provided hemispheric snow mass estimates for the NOAA Arctic Report Card (2017 edition to 2020 edition; e.g., Mudryk et al., 2020) but has been superseded by a version forced with ERA5. In addition to updated forcing, following ECCC's push to provide transparent and reproducible open-source climate assessment tools based on FAIR (findable, accessible, interoperable, and reusable) principles (Environment and Climate Change Canada, 2021), we are motivated to release the B-TIM as an open-source code following updated coding standards.

Discrepancies in snow from “online” coupled reanalyses arise from many sources, including inconsistencies in terms of snow data assimilation schemes, underlying snow and land–surface component model differences, atmospheric model differences, differences in processes governing the coupled surface energy balance, and interactions between all these factors. To highlight one example which we will discuss in more detail below, while the assimilation of snow data may improve instantaneous estimates of snow depth, there is evidence that significant time series discontinuities may result as contributions to the data stream change (as in ERA5; Mortimer et al., 2020). Like any offline TIM, the B-TIM does not assimilate snow data, does not capture surface energetics, and features no coupling between snow and the land–atmosphere state. The B-TIM offline snow products, provided they are suitably validated, can thus isolate the role of meteorological driving from issues related to data assimilation, model bias, and errors arising from coupling, all of which can be sources of discrepancy for more sophisticated snow datasets. In this work, we use a fixed version of the B-TIM without further calibration or tuning. Therefore, one parameter set for the model is used, and the results may still contain model bias. Quantifying this bias for the B-TIM can be done through the analysis of parameter and error sensitivity (Essery, 2015; Raleigh et al., 2015). However, our aim is to investigate reanalysis snow biases; each offline snow product will have the same model bias, whereas the coupled reanalysis snow does not. Comparing offline snow products therefore narrows down the sources of discrepancy without requiring a re-run of the complex snow modelling and data assimilation process.

We document an updated B-TIM algorithm (Sect. 2), which we release here as an open-source, self-contained Python repository. We then use the B-TIM to generate offline SWE and snow cover extent using temperature and precipitation forcing from the global reanalyses ERA5, JRA-55, and MERRA-2 for 1980–2020. Through validation with in situ data, we compare the realism of the offline B-TIM and online coupled reanalyses (Sect. 3). This study has as its focus hemispheric snow. Even at these large scales and excluding complexities tied to mountain snow modelling, there are discrepancies that should be characterized. For exploration into regional performance, two other studies have been prepared for publication: Mudryk et al. (2024) and Mortimer et al. (2024). These include all the datasets discussed here. Mudryk et al. (2024) evaluate a suite of 23 gridded SWE products, ranking them by performance and inter-dataset consistency. Mortimer et al. (2024) present an expanded reference SWE dataset that combines in situ and airborne SWE measurement and assess snow dataset performance against that record. The same in situ data are used for this study. Our main scientific work here, which is described in Sect. 3, will be to use the B-TIM to characterize and explain discrepancies amongst online reanalysis snow products' climatological characteristics and interannual variabilities. This analysis will include the use of bias-adjusted temperature and precipitation forcing in the B-TIM to elucidate sources of discrepancy. We discuss results and conclusions in Sect. 4.

2 Data and methods

2.1 The B-TIM snow model

The calculations described in this section, which can also be seen in the schematic in Fig. 1, comprise version 1.0.0 of the B-TIM (https://doi.org/10.5281/zenodo.10044950; Elias Chereque et al., 2024). This is the first updated description in this model's 2 decades of usage in many publications and applications. Relative to Brown et al. (2003), we provide updated documentation and changes to some constants, as reflected in the code and its parameters. Physical constants and parameter values can be found in Table S1 in the Supplement. At a given time step, we denote the initial snow depth and density by Di and ρi. SWEi is the initial time step's snow water equivalent, calculated as SWEi= ρiDi (with units of kg m−2). All densities have standard units (kg m−3), and snow depths have units of metres.

https://tc.copernicus.org/articles/18/4955/2024/tc-18-4955-2024-f01

Figure 1Conceptual overview of the Brown Temperature Index Model (B-TIM). At every time step and location, temperature and precipitation values are used to compute either the density and depth of any new snow or the temperature of any rainfall. The snowpack state (snow depth and density) is affected by rain melt, melting due to air temperature, and one of two densification processes which cause both depth and density variables to evolve.

Download

2.1.1 Initialization, meteorological driving, and time stepping

Each simulated snow year is initialized from snow-free conditions on 1 August and runs until the following 31 July; 2 m temperature and total precipitation (frozen and solid) are the only inputs to the model. The specific variables we used from each reanalysis are listed in Table S2. A fixed 20 % precipitation reduction is implemented at each model time step as a general loss parameter – this captures canopy interception, sublimation, and blowing snow for frozen precipitation. The variable P represents the reduced precipitation in metres of water for a given time step and location. The model time step is 1 h, but less frequent driving data can be handled. If needed, the model linearly interpolates temperature to hourly steps and divides accumulated precipitation by the duration of the driving-data time step in hours.

2.1.2 Determining precipitation phase

At each model time step, the precipitation phase is classified as snow or rain using a 0 °C threshold. Previous B-TIM applications allowed mixed precipitation between 0 and 2 °C following a linear relationship for the liquid fraction. For large-scale studies, there is little advantage to including mixed precipitation according to the linear relationship as opposed to a fixed threshold as both are coarse simplifications (Jennings and Molotch, 2019). The absence of mixed precipitation has a minimal impact on the aggregated variables, though it causes local differences in regions with ephemeral snow.

2.1.3 Updating snow depth and density

Following Hedstrom and Pomeroy (1998), frozen precipitation during a time step is assigned a “new-snow” density:

(1) ρ new = A + Be T / C T < 0 ° C ,

where T is the air temperature (values of the constants are listed in Table S1).

Intermediate values for snow depth and density are assigned to the model's single snow layer.

(2a)D=Di+Pρwρnew(2b)ρ=Diρi+PρwD

Three densification or melting steps are then applied to evolve ρ and D.

  1. Snowmelt is computed at each model time step using a melt factor, γ (mm w.e. K−1 h−1), which is based on the intermediate snow layer density, ρ. The relationship used to calculate γ is based on Kuusisto (1984):

    (3) γ = M 1 ρ - M 2 .

    Lower and upper bounds of 4.1×10-3 and 0.23, respectively, are enforced on γ. Hourly melt, represented as the change in snow depth ΔDm, follows a standard temperature index approach:

    (4) Δ D m = - T - T melt ρ γ , T > T melt 0 , T T melt ,

    where Tmelt=-1 °C is the threshold air temperature used for snowmelt.

    The leading coefficient in Eq. (3), M1, has been halved relative to Brown et al. (2003) to reduce the rate of snowmelt during the ablation season. This has been implemented for the CMC snow product.

  2. Snowmelt caused by rainfall on the snowpack is computed using

    (5) Δ D r = - R ρ w C w ( T w - T freeze ) L f ρ .

    Cw is the heat capacity of water (J kg−1 K−1), R is the total rainfall (m), Tw is the rainfall temperature (°C), ρw is the density of liquid water, and Lf is the latent heat of fusion for ice (J kg−1). Rain temperature is taken to be equal to air temperature, as in Brown et al. (2003), and the snowpack is assumed to be isothermal and 0 °C, implying instant melting of the snowpack when it is warmed.

  3. A second intermediate snow depth is computed based on these first two steps.

    (6) D = D + Δ D m + Δ D r

    Depending on air temperature, one of two possible densification processes is implemented. Both processes initially affect density (Eqs. 7 and 9), and then snow depth is adjusted to conserve total water.

    Cold. When temperatures are below Tmelt, cold-snow ageing is implemented as follows:

    (7) Δ ρ c = C 1 SWE exp C 3 T melt - T snow exp - C 2 ρ , T < T melt .

    SWE is the snow water equivalent (kg m−2, calculated as the product ρD). C1 and C2 are empirically derived constants. This formulation was proposed in Anderson (1976), and the parameters used in the B-TIM are in the accepted range. Tsnow is the snow temperature, taken to be equal to air temperature in this step. Snowpack density is allowed to vary between 200 and 550 kg m−3. The densification process does not vary seasonally.

    Warm. When temperatures are above 1 °C, the snowpack undergoes a warm settling process, which increases the density more rapidly. A maximum density is first defined with dependence on the intermediate snow depth:

    (8) ρ max = W max - W 1 D 1 - exp - D W 2 .

    This is then adjusted by the intermediate density:

    (9) Δ ρ w = ρ max - ρ 1 - e - a Δ t , T - 1 ° C .

    The value of a is such that in, one model time step (Δt=3600 s), the density difference is adjusted by 1 % of ρmax-ρ, which constitutes a change in density of a few percent for typical values of ρ.

    The final density is calculated as ρf=ρ+Δρw,T-1 °C, else ρf=ρ+Δρc, and the final depth is calculated after the densification process in the following manner to conserve water:

    (10) D f = D ρ ρ f .

The final snow depth and density values are carried to the next model time step, and new meteorological forcing is read in. The values of the prognostic variables are recorded at daily frequency and saved in monthly files. Annual total SWE and maximum SWE are tracked over the model year, and the values are saved at the end of the run.

2.2 Reanalysis products

In this work, we use three current-generation reanalyses which produce snow variables for 40 years or more for the Northern Hemisphere. We use the ECMWF Reanalysis version 5 (ERA5) (Dutra et al., 2012; Hersbach et al., 2020), the second-generation Modern-Era Retrospective analysis for Research and Applications from the National Aeronautics and Space Administration (MERRA-2) (Gelaro et al., 2017; Reichle et al., 2017), and the Japanese Meteorological Agency's 55-year Reanalysis (JRA-55) (Kobayashi et al., 2015). These products differ from one another with respect to data assimilation schemes, as well as in terms of their component atmospheric and land models. All three global reanalyses assimilate conventional atmospheric measurements, but ERA5 and JRA-55 additionally assimilate snow depth observations and satellite-derived snow extent information.

The different techniques used to constrain ERA5 and JRA-55 SWE using snow cover observations are described below in more detail. Additional comparisons of the reanalyses are documented in Sect. S1 of the Supplement.

Beginning in 2004, ERA5 assimilates the Interactive Multisensor Snow and Ice Mapping System (IMS) snow cover product wherever the model first guess indicates snow-free conditions (de Rosnay et al., 2015). In the IMS snow cover product, grid cells are either snow-covered or snow-free. Snow-free observations are treated as observations of 0 cm snow depth, while observations of full snow cover are treated as 5 cm of snow depth. These observations, together with the in situ snow depth measurements, enter the 2D Optimal Interpolation (2D-OI) scheme to update the snow depth. The inclusion of IMS snow cover in the data stream reduces the overall snow amounts and is associated with a discontinuity in ERA5 snow (Mortimer et al., 2020). We highlight this effect through comparison with ERA5Snow, a data product produced by an offline run of the ECMWF land model. ERA5Snow is produced with the same land surface data assimilation as ERA5, except for the IMS satellite snow product (de Rosnay, private access to data). It is distinct from the offline ERA5-Land product produced by ECMWF.

JRA-55 constrains snow using passive microwave observations from 1987 to the present, and climatological snow cover fills any gaps back to 1980. Though the microwave data processing methods are not fully documented in the peer-reviewed literature, Kobayashi et al. (2015) say the estimates of snow cover extent come from comparing brightness temperature at different frequencies (37 and 19 GHz at both horizontal and vertical polarization) to regionally and seasonally varying thresholds. All the snow is removed from grid cells where the land surface analysis indicates the presence of snow and the satellite observations do not. Snow is added to grid cells where the land surface analysis does not indicate snow but the satellite observations do. Unlike the fixed relationship between snow cover and snow depth used in ERA5, when the algorithm adds snow in JRA-55, it is a variable snow depth that would reduce land surface temperatures to freezing if it were to melt. Wherever the satellite and land surface analyses agree (both report snow-covered conditions or both report no-snow conditions), no adjustment is made.

2.3 Temperature and precipitation biases

Biases in temperature and precipitation directly impact both online and offline snow products, and our aim is to separately characterize their effects. Briefly comparing temperature and precipitation fields from the three reanalysis products, MERRA-2 exhibits the lowest hemispheric mean land temperatures for most of the year, and JRA-55 exhibits the highest (Fig. 2a). In the winter months, the JRA-55 mean temperature exceeds that of ERA5 by 2.15 K and that of MERRA-2 by nearly 3 K, with the largest temperature difference occurring in January. In addition to being the coldest on average, MERRA-2 has the largest land area capable of sustaining snow, diagnosed as regions with T<0 °C (Fig. 2b). This frozen land area exceeds that of ERA5 by 1 ×106 km2 or more during the shoulder seasons of autumn and spring.

With respect to total precipitation, JRA-55 is about 10 % wetter than the other two products across all months (Fig. 2c). MERRA-2 and ERA5 agree more closely, with differences of just 1 % in autumn and spring. ERA5 is 4 % wetter in the winter, and MERRA-2 is about 6 % wetter in the summer months. We investigate the roles of these forcing biases in SWE biases by implementing a simple climatological bias correction (method is described in Sect. S2).

https://tc.copernicus.org/articles/18/4955/2024/tc-18-4955-2024-f02

Figure 2Climatologies of mean temperature, frozen area, and total precipitation over Northern Hemisphere land areas, excluding mountains, computed twice monthly using 14 d windows centred on the 1st and 15th of each month.

Download

2.4 Topography, land mask, and regional definitions

Mountain regions are excluded from our analysis using a mask derived from the Global Earth Topography and Sea Surface Elevation at 30 arcsec resolution digital elevation model (GETASSE30 DEM). Locations in the DEM with local slopes greater than 2° are defined as mountainous. After coarsening the slope mask (to 0.25° × 0.25°, the ERA5 resolution), grid cells that are more than 95 % mountainous are recorded in a binary mountain mask file which is coarsened as needed using a nearest-neighbour algorithm (for the MERRA-2 or JRA-55 grids).

To define land grid cells, we use the land–sea masks associated with each reanalysis. The land fraction is used to scale grid cell land area when computing total snow mass, which depends on SWE and land area. When possible, computations are done on a dataset's native grid, and conservative regridding is applied to the SWE data, conserving total snow mass, before calculating grid-dependent metrics.

2.5 In situ validation of SWE datasets

We evaluate the SWE values from (offline) the B-TIM and (online) reanalysis by comparing them to a combined historical snow course and airborne gamma-derived SWE dataset. These data are independent from snow data assimilated in JRA-55 and ERA5. Snow course observations involve manual measurements of snow depth and density along a predefined transect, with measurements being averaged to obtain a single SWE value for each transect on a specific date (WMO, 2018). The measurement frequency for snow courses varies by jurisdiction, ranging from monthly measurements in Alaska, the western continental US, and most of Finland to measurements every 5 d during the spring snowmelt period in Russia. The Russian network has the highest sampling frequency and is well-distributed across the landscape, while dense networks with lower sampling frequencies are found in Finland, the northeastern US, and parts of southern Canada. Airborne-gamma SWE estimates are calculated by differencing snow-free and snow-covered measurements after accounting for background soil moisture. Flights are 15–20 km long with a 300 m wide footprint. Data are available for the United States and southern parts of some Canadian provinces. There is broad consistency between snow courses and airborne gamma observations (Mortimer et al., 2024), and so we are confident in using both types of information together to evaluate the two types of products.

Using the method in Mortimer et al. (2024), reference SWE data are matched in space and time with the gridded product data. Data are then spatially aggregated and summarized using bias, unbiased root mean squared error (uRMSE), and correlation. We compare data pairs for November through March for all years between 1980 and 2020, aiming to include as many measurements as possible before the snowmelt period. The validation is performed on non-mountainous points with non-zero SWE values below 500 mm that are simultaneously available for the reference data and all the estimates. The latter condition excludes some snow courses in coastal areas due to differing land, ice, and/or water masks and is consistent with our snow mass calculations. In this study, there is no spatial aggregation by land type.

3 Results

We compare snow from global reanalyses (ERA5, ERA5Snow, MERRA-2, and JRA-55) to snow from the offline B-TIM runs. The offline snow products are named BrE5, BrM2, and BrJ55, reflecting the use of distinct reanalysis meteorology for each version but the same B-TIM snow model to produce snow. Two of the reanalysis datasets, ERA5 and ERA5Snow, share the same temperature and precipitation inputs (“meteorology”). Therefore, there is only one BrE5 dataset produced. Standardizing through using a single model means that differences between the offline B-TIM runs primarily reflect differences in the forcing data.

3.1 The B-TIM compares well to in situ observations

Comparing modelled snow to in situ observations is one way to assess the realism and performance of each product. In general, we find a much broader spread in modelled SWE for high reference SWE values, showing overall decreasing model skill with increasing snow depth (Fig. 3a–f). Each scatterplot contains over 200 000 data points. In some products (JRA-55 and MERRA-2), there is a cluster of points where the modelled snow is shallow, but the reference SWE indicates deep snow. The B-TIM products have greater absolute bias than their respective reanalysis products, but they are of comparable magnitude. When mountain points are included, the B-TIM products have lower absolute bias than the reanalyses (not shown). Low bias does not necessarily mean good performance as individual positive and negative differences can cancel each other out. Of the reanalyses, ERA5 and ERA5Snow have the lowest uRMSE and highest correlation compared to the reference values, and so they outperform JRA-55 and MERRA-2 overall. The RMSE (calculated as the bias and uRMSE added in quadrature) of each offline snow product is less than that of its reanalysis counterpart. By these measures, all three B-TIM products have comparable skill to ERA5 and ERA5Snow. Finally, unlike their reanalysis counterparts, BrJ55 and BrM2 do not display the cluster of false low snow values.

https://tc.copernicus.org/articles/18/4955/2024/tc-18-4955-2024-f03

Figure 3SWE product validation against snow course and gamma SWE measurements. Panels (a)(f) consist of scatterplots showing all valid data pairs (snow course, product) from November to March over 1980–2018. The scatterplots are coloured as heat maps to display the concentration of data points, which is highest where the colour is red. Summary statistics, including the bias, unbiased root mean squared error (uRMSE), and correlation, are included in the legend and are summarized in (g)(i).

Download

These validation results show two things. First, the offline products capture realistic snow patterns when compared to ground measurements, even in the context of snow from more complex coupled reanalyses. Second, we see that snow data assimilation does not guarantee skilful snow modelling by these measures. In particular, ERA5 or ERA5Snow and JRA-55 are both produced with snow cover data assimilation (see Sect. 2.2), but while the former two are the best-performing products, the latter performs poorly (with high uRMSE and low correlation) and struggles with both false low snow values and large overestimations relative to ground truth. MERRA-2 does not assimilate snow data but also performs moderately according to the comparison metrics. Additionally, model complexity does not guarantee skilful snow modelling. The offline products generated with the B-TIM, with neither snow data assimilation nor coupled interactions between snow and the land–atmosphere system, perform comparably to each other and to the relatively more complex ERA5 and ERA5Snow, despite differences in the forcing data.

3.2 Using the B-TIM to assess discrepancies between reanalysis snow products

3.2.1 Discrepancies in reanalysis snow climatologies are caused by forcing-data biases

Marked differences appear in the magnitude of total snow mass and snow-covered area for the products considered here. Among the online (the B-TIM) datasets, JRA-55 (BrJ55) has the highest peak snow mass, exceeding the maximum value of MERRA-2 (BrM2) by about 0.15 × 1015 kg (0.17 × 1015 kg) and the maximum value of ERA5 (BrE5) by 0.73 × 1015 kg (0.77 × 1015 kg), as seen in Fig. 4. The relative rankings of these products and the biases in the peak snow mass are closely reproduced by the offline model; since the offline model can reproduce the biases, we explore the possibility that they are directly caused by the forcing biases discussed in Sect. 2.3, which can equally affect both types of products. We test if these inter-product biases can be manipulated – in particular, minimized – by bias-correcting the meteorological fields used to drive the B-TIM. If biases in mean meteorological conditions are the primary source of snow bias, a correction toward more similar climatological conditions should yield more similar offline modelled snow. We implement a basic multiplicative correction for each month using climatological temperature and precipitation conditions (see the Supplement). Then, for each possible pair (e.g., ERA5 targeting MERRA-2 climatology), three experiments are run: one with adjusted temperature, one with adjusted precipitation, and one with both variables adjusted. This yields 18 datasets in addition to the three unadjusted B-TIM datasets, the three reanalysis datasets, and ERA5Snow.

https://tc.copernicus.org/articles/18/4955/2024/tc-18-4955-2024-f04

Figure 4(a) Snow mass climatology over Northern Hemisphere land, with grid cells exceeding 500 mm capped at 500 mm. (b) Snow-covered area climatology, calculated using areas of grid cells with more than 4 mm SWE.

Download

Comparing the bias-adjusted versions of BrE5 and BrM2 (Fig. 5) indicates that temperature biases are the main driver for the differences between ERA5 and MERRA-2 snow mass and snow cover shown in Fig. 4; in the experiments where the ERA5 and MERRA-2 temperature climatologies are bias-adjusted, the resulting snow fields are also much more similar. Precipitation biases play a smaller role, and correcting the precipitation modestly decreases the snow mass biases over the whole season. Snow-covered area is not very sensitive to the precipitation correction, though the best agreement in both cases comes from rescaling both variables. However, mean biases in forcing variables do not explain all the difference in SWE. For the pairs involving JRA-55 (Figs. S1 and S2), the precipitation correction improves the agreement between a dataset and a chosen target but not at the level observed for the ERA5–MERRA-2 pair; the temperature scaling sometimes degrades the agreement. However, JRA-55 is several degrees warmer and about 10 % wetter than the other two reanalyses on average over the region of interest (Fig. 2), constituting more substantial differences than those that exist between ERA5 and MERRA-2.

To summarize, the B-TIM products (BrE5, BrM2, BrJ55) retain the relative biases present in the reanalyses. Motivated by this, we have explored the potential use of bias correction on the meteorological forcing to elucidate the drivers of these snow biases or to correct them to the first order. This approach isolates a subset of drivers and gives insight into the dominant sources of snow biases but requires more refinement to explain biases (see discussion in Sect. 4) more fully.

3.2.2 The B-TIM versions of SWE fields show consistency in seasonal cycle and interannual variability

Aside from JRA-55, which has delayed snow accumulation but an early peak SWE, all the other datasets agree that the snow mass maximum occurs within a 2-week period centred on 15 March. For snow-covered area, all datasets except MERRA-2 peak during the 14 d period centred on 1 February; the MERRA-2 maximum occurs 2 weeks earlier. Thus, unlike the reanalyses, the B-TIM products provide more consistent descriptions of key snowpack climatology metrics.

Figure 6 shows the September–October–November (SON) mean snow mass time series, calculated over land regions from 40–90° N (excluding mountains), with the same 500 mm maximum imposed as before to exclude high SWE values over isolated grid cells. Figure 7 shows the December–January–February (DJF) time series of mean snow mass. In these figures, dashed lines are used for reanalysis snow, and the solid lines show offline snow. Even without detrending and removing the mean, it is clear that the solid lines are highly consistent with each other (for both continents and both seasons; panels a and c), while there is much more disagreement between reanalysis products. This highlights the role that factors other than forcing biases play in introducing inter-product differences. We quantify the consistency in the offline–offline and reanalysis–reanalysis pairs by calculating correlation coefficients after removing the least-squares linear fit (Fig. 8). Detrending by other methods yields similar results (e.g., using the Theil–Sen estimator, which is robust with respect to outliers and shifts to the start and end of the time series, not shown). Across all regions and all seasons, the B-TIM products are strongly correlated with one another (r>0.85), whereas the reanalysis r values are lower in general and greatly depend on the pair.

https://tc.copernicus.org/articles/18/4955/2024/tc-18-4955-2024-f05

Figure 5(a, b) NH snow mass and (c, d) snow cover extent differences for MERRA-2 and ERA5 calculated as original minus target. Each panel shows the difference between the original and target snow mass climatologies (black), and the coloured lines represent the datasets generated by adjusting temperature (pink), precipitation (blue), or both (yellow) to the target dataset's climatology.

Download

https://tc.copernicus.org/articles/18/4955/2024/tc-18-4955-2024-f06

Figure 6Time series of total snow mass for SON by continent. Lower row has linear trends removed.

Download

The reanalysis JRA-55 snow mass is unique, characterized by large decadal variations. Positive anomalies are most common from 1980 to 1994 and from 2010 to 2020, while negative anomalies occur from 1995 to 2009 (Fig. 7d). These inconsistencies are not as extreme over Eurasia as JRA-55 captures positive and negative anomalies that are mostly in agreement with the remaining datasets, but its variations have the greatest magnitude (e.g., 1991, 2014). The disagreement is substantial in terms of snow mass amount. Over North America, especially before 1995, the reanalysis JRA-55 dataset has as much as 50 % more snow mass than the other reanalyses. This behaviour is not present in BrJ55. Additional comparison with in situ data indicates that the version of JRA-55 that has less interannual variability (BrJ55, solid orange) also has a significantly lower RMSE and higher correlation with the in situ data than the native JRA-55 (dashed orange; Fig. S3).

We now return to consider the two versions of the ERA5 reanalysis: ERA5 and ERA5Snow (dashed, blue in two shades, Figs. 6 and 7). The two time series diverge due to a change in the snow cover extent data assimilation in 2004. The mean difference in DJF snow mass over North America between these two products is 5 times greater after 2004 compared to before 2004 (9×1013 and 1.8×1013 kg, respectively) and 3 times greater after 2004 for Eurasia (7.7×1013 kg compared with 2.4×1013 kg). This step change is problematic for trend and correlation assessments, and so we use ERA5Snow in Fig. 9 below. As an offline product, BrE5 does not display the step change in 2004.

https://tc.copernicus.org/articles/18/4955/2024/tc-18-4955-2024-f07

Figure 7Time series of total snow mass for DJF by continent. Lower row has linear trends removed.

Download

https://tc.copernicus.org/articles/18/4955/2024/tc-18-4955-2024-f08

Figure 8(a) Correlation coefficients for the B-TIM dataset pairs. Individual values are shown with black points, and the mean is represented by the height of the bar to summarize the group. Similar information is shown in (b) for the reanalysis dataset pairs.

Download

These two examples show that the B-TIM snow datasets can generate reasonably performing benchmark datasets which are useful to contrast against native snow data. The comparison between reanalysis and the offline product forced with the same meteorology can highlight spurious variability, as in the case of JRA-55, or can point to temporal inhomogeneities, as with ERA5.

The consistency found for the offline products extends to spatial patterns. The time series of the DJF spatial pattern correlation between dataset pairs is shown in Fig. 9, with SON values shown in Fig. S4. For both seasons, offline–offline pairs are the most consistent with each other (with the highest r values) despite different meteorological forcings. There is also evidence of spatial disagreement between some of the reanalysis products. Notably, JRA-55 is very different from all the other datasets. This can be seen with the ERA5Snow–JRA-55 and MERRA-2–JRA-55 pairs (different model, different forcing), which have the weakest spatial correlations, or with the BrJ55–JRA-55 pair (different model, same forcing), which has a much lower correlation compared to the other same-forcing pairs. We remind the reader that ERA5Snow is used instead of ERA5 as the reanalysis snow product with the aim of removing the discontinuity around 2004; the meteorology is the same in both products so ERA5Snow can be compared to BrE5. The pattern correlations appear to be stable across the 40-year period for all pairs, although those involving JRA-55 have larger year-to-year variability. Additionally, reanalysis and offline versions of JRA-55 snow have low spatial correlation across all seasons and both continents compared to the ERA5 and MERRA-2 (Fig. S5). Broadly, reanalysis and offline patterns are less similar over Eurasia for a given season, and the correlation decreases over the snow year.

https://tc.copernicus.org/articles/18/4955/2024/tc-18-4955-2024-f09

Figure 9Spatial correlations for DJF calculated between pairs of datasets with the same meteorology (a) and between pairs of similar type (b) (either offline–offline or reanalysis–reanalysis).

Download

4 Discussion and conclusions

We can summarize our key points as follows:

  • An updated and more complete description of the B-TIM offline snow model has been provided for the first time since 2003, accompanied by an open-source code release of the model implemented in Python.

  • The offline B-TIM snow generated using meteorological forcing from three reanalysis products has been validated against an independent set of in situ snow observations (Sect. 2.5). The offline products perform generally as well as (online coupled) reanalysis snow. Based on this result, datasets generated with the B-TIM are treated as reasonably performing benchmark estimates of historical snow and are used to investigate discrepancies in reanalysis SWE.

  • Compared to online reanalysis snow, the offline B-TIM snow yields far more consistent interannual variability for both aggregate and spatially resolved snow metrics. This suggests the potential utility of the B-TIM as an offline tool for simplified snow modelling in seasonal to decadal prediction systems and climate downscaling for impact analysis.

  • Climatological characteristics of the offline B-TIM snow are generally more consistent with one another for various measures than reanalysis snow despite differences in the meteorological forcing data. Using the B-TIM with bias-adjusted forcing, climatological SWE differences between ERA5 and MERRA-2 are found to primarily come from temperature biases (MERRA-2 is colder, resulting in more SWE throughout the snow season). Attribution of discrepancies in terms of JRA-55 with the other two reanalyses is not as straightforward, as we discuss next.

Offline modelling has allowed us to understand some of the components contributing to the spread in SWE estimates across these three reanalyses. In general, nonlinearities inherent to snow modelling mean that it is unclear how exactly meteorological biases will impact modelled SWE fields for both historical and modelled future snow conditions (Evan and Eisenman, 2021; Räisänen, 2023; Sospedra-Alfonso and Merryfield, 2017). Interpreting the causes of SWE differences is further complicated when comparing products produced using different snow models and different data assimilation schemes. In this sense, the B-TIM can easily generate simplified benchmark datasets (no data assimilation and a single, simple model) alongside more complex products of interest. Here, we have attempted to attribute climatological SWE biases to climatological meteorological biases by adjusting each of the two forcing variables and calculating the effect on the SWE. We have taken advantage of the B-TIM's speed, which has allowed us to perform many cross-tests.

Future work should continue developing the B-TIM through systematic testing of parameter values. For example, the spatial variability and sensitivity of the model to the 20 % precipitation loss have not recently been characterized. This type of work is possible due to the recent increases in the availability and quality of in situ SWE, snow depth, and snow density information (Vionnet et al., 2021) for validation. Forcing biases and parameter changes both strongly influence modelled snow (Cho et al., 2022; Essery, 2015; Günther et al., 2019; Menard et al., 2021), and they should be characterized for the B-TIM. However, offline modelling can broadly be seen as a tool to investigate snow biases in products where additional simulations are not feasible, as is the case for reanalysis.

The simple bias adjustment methodology we use requires more refinement to fully explain the biases. Large differences between a dataset and a chosen target may make the multiplicative scaling less suitable, for example, by changing the input variable distributions significantly. Additionally, other aspects that are not captured in mean conditions can influence SWE in models, such as the nature of the diurnal cycle in temperature and the distribution of precipitation intensity and/or duration (or a combination of the two). Multiplicative rescaling can affect these aspects when adjusting a dataset to a chosen target, with the greatest impact coming from adjusting both driving variables at once. These effects are most relevant at the shoulder seasons and for areas with ephemeral snow.

Using reference in situ data and inter-dataset consistency arguments, we have shown that terrestrial SWE taken directly from the JRA-55 reanalysis is problematic and should not be used for climate analysis. Unlike the BrJ55 product, which performs comparably to the BrE5 and BrM2 products, the reanalysis JRA-55 terrestrial snow product is the least accurate with respect to the in situ validation. Furthermore, the interannual variability of the JRA-55 snow mass anomaly time series (Figs. 5 and 6) and corresponding SWE field patterns (Fig. 7) differ greatly from all the other datasets. The strong performance of BrJ55 suggests that the problem with JRA-55 snow arises from the JRA-55 snow model and data assimilation.

Snow is a critical component of the climate system, influencing a range of environmental and societal processes. Accurate snow modelling is needed for applications that require a long time series (e.g., trend analysis) and the best instantaneous estimates of SWE (e.g., numerical weather prediction). Here, we have demonstrated the value of a simple model like the B-TIM in helping us assess new products against self-consistent benchmarks as they are released. These considerations will continue to be important as we look ahead to the next generation of global reanalyses, including the JMA Reanalysis for Three Quarters of a Century, which is now available (JRA-3Q; Kosaka et al., 2024), and ERA6 (Balsamo et al., 2023).

Code and data availability

ERA5 data were retrieved from the Copernicus Climate Data Store (single levels: https://doi.org/10.24381/cds.adbb2d47; Hersbach et al., 2023). ERA5Snow data are available on request from patricia.rosnay@ecmwf.int. JRA-55 data were retrieved from the NCAR Research Data Archive (all collections: https://doi.org/10.5065/D6HH6H41; Japan Meteorological Agency/Japan, 2023). MERRA-2 data were retrieved from the Goddard Space Flight Center Distributed Active Archive Center (GSFC DAAC).

The description of the combined snow reference dataset is in Mortimer et al. (2024).

Processed output from the B-TIM runs and reanalysis data, required to reproduce the figures, is archived at https://doi.org/10.5683/SP3/IV6SVJ (Elias Chereque, 2024a).

Snow output modelled by the B-TIM with all three forcings is also archived at the following links: BrE5 – https://doi.org/10.5683/SP3/HHIRBU (Elias Chereque, 2024b), BrM2 – https://doi.org/10.5683/SP3/C5I5HN (Elias Chereque, 2024c), and BrJ55 – https://doi.org/10.5683/SP3/X5QJ3P (Elias Chereque, 2024d).

Supplement

The supplement related to this article is available online at: https://doi.org/10.5194/tc-18-4955-2024-supplement.

Author contributions

AEC updated the B-TIM code and generated the modelled snow data. In situ validation was done by CM with input from LM and CD. The snow data comparison method was developed jointly by AEC, PJK, LM, and CD and executed by AEC. AEC prepared the paper with contributions from all the co-authors.

Competing interests

At least one of the (co-)authors is a member of the editorial board of The Cryosphere. The peer-review process was guided by an independent editor, and the authors also have no other competing interests to declare.

Disclaimer

Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

The results contain modified Copernicus Climate Change Service information. Neither the European Commission nor ECMWF is responsible for any use that may be made of the Copernicus information or data it contains.

Financial support

This research has been supported by the Canadian Space Agency (grant no. 16SAUSSNOW).

Review statement

This paper was edited by Franziska Koch and reviewed by two anonymous referees.

References

Anderson, E. A.: A point energy and mass balance model of a snow cover, NOAA Technical Report NWS, United States National Weather Service, 172 pp., https://repository.library.noaa.gov/view/noaa/6392 (last access: 31 October 2024), 1976. 

Balsamo, G., Rabier, F., Balmaseda, M., Bauer, P., Brown, A., Dueben, P., English, S., McNally, T., Pappenberger, F., Sandu, I., Thepaut, J.-N., and Wedi, N.: Recent progress and outlook for the ECMWF Integrated Forecasting System, EGU General Assembly 2023, Vienna, Austria, 23–28 Apr 2023, EGU23-13110, https://doi.org/10.5194/egusphere-egu23-13110, 2023. 

Betts, A. K., Desjardins, R., Worth, D., Wang, S., and Li, J.: Coupling of winter climate transitions to snow and clouds over the Prairies, J. Geophys. Res.-Atmos., 119, 1118–1139, https://doi.org/10.1002/2013JD021168, 2014. 

Bokhorst, S., Pedersen, S. H., Brucker, L., Anisimov, O., Bjerke, J. W., Brown, R. D., Ehrich, D., Essery, R. L. H., Heilig, A., Ingvander, S., Johansson, C., Johansson, M., Jónsdóttir, I. S., Inga, N., Luojus, K., Macelloni, G., Mariash, H., McLennan, D., Rosqvist, G. N., Sato, A., Savela, H., Schneebeli, M., Sokolov, A., Sokratov, S. A., Terzago, S., Vikhamar-Schuler, D., Williamson, S., Qiu, Y., and Callaghan, T. V.: Changing Arctic snow cover: A review of recent developments and assessment of future needs for observations, modelling, and impacts, Ambio, 45, 516–537, https://doi.org/10.1007/s13280-016-0770-0, 2016. 

Boone, A. and Etchevers, P.: An Intercomparison of Three Snow Schemes of Varying Complexity Coupled to the Same Land Surface Model: Local-Scale Evaluation at an Alpine Site, J. Hydrometeorol., 2, 374–394, https://doi.org/10.1175/1525-7541(2001)002<0374:AIOTSS>2.0.CO;2, 2001. 

Brown, R. D. and Brasnett, B.: Canadian Meteorological Centre (CMC) Daily Snow Depth Analysis Data. NASA National Snow and Ice Data Center Distributed Active Archive Center [data set], https://doi.org/10.5067/W9FOYWH0EQZ3, 2010 (updated annually). 

Brown, R., Derksen, C., and Wang, L.: A multi-data set analysis of variability and change in Arctic spring snow cover extent, 1967–2008, J. Geophys. Res.-Atmos., 115, D16111, https://doi.org/10.1029/2010JD013975, 2010. 

Brown, R. D. and Robinson, D. A.: Northern Hemisphere spring snow cover variability and change over 1922–2010 including an assessment of uncertainty, The Cryosphere, 5, 219–229, https://doi.org/10.5194/tc-5-219-2011, 2011. 

Brown, R. D., Brasnett, B., and Robinson, D.: Gridded North American monthly snow depth and snow water equivalent for GCM evaluation, Atmos.-Ocean, 41, 1–14, https://doi.org/10.3137/ao.410101, 2003. 

Cho, E., Vuyovich, C. M., Kumar, S. V., Wrzesien, M. L., Kim, R. S., and Jacobs, J. M.: Precipitation biases and snow physics limitations drive the uncertainties in macroscale modeled snow water equivalent, Hydrol. Earth Syst. Sci., 26, 5721–5735, https://doi.org/10.5194/hess-26-5721-2022, 2022. 

Dee, D. P., Uppala, S. M., Simmons, A. J., Berrisford, P., Poli, P., Kobayashi, S., Andrae, U., Balmaseda, M. A., Balsamo, G., Bauer, P., Bechtold, P., Beljaars, A. C. M., Van De Berg, L., Bidlot, J., Bormann, N., Delsol, C., Dragani, R., Fuentes, M., Geer, A. J., Haimberger, L., Healy, S. B., Hersbach, H., Hólm, E. V., Isaksen, L., Kållberg, P., Köhler, M., Matricardi, M., McNally, A. P., Monge-Sanz, B. M., Morcrette, J.-J., Park, B. -K., Peubey, C., De Rosnay, P., Tavolato, C., Thépaut, J.-N., and Vitart, F.: The ERA-Interim reanalysis: configuration and performance of the data assimilation system, Q. J. Roy. Meteor. Soc., 137, 553–597, https://doi.org/10.1002/qj.828, 2011. 

de Rosnay, P., Isaksen, L., and Dahoui, M.: Snow data assimilation at ECMWF, ECMWF, https://doi.org/10.21957/LKPXQ6X5, 2015. 

Doesken, N. J. and Judson, A.: The snow booklet: a guide to the science, climatology, and measurement of snow in the United States, 2nd Edn., Colorado Climate Center, Dept. of Atmospheric Science, Colorado State University, Fort Collins, CO, 86 pp., ISBN 0965105628, 1997. 

Dutra, E., Viterbo, P., Miranda, P. M. A., and Balsamo, G.: Complexity of Snow Schemes in a Climate Model and Its Impact on Surface Energy and Hydrology, J. Hydrometeorol., 13, 521–538, https://doi.org/10.1175/JHM-D-11-072.1, 2012. 

Elias Chereque, A.: Replication Data for: “A simple snow temperature index model exposes discrepancies between reanalysis snow water equivalent products” (Elias Chereque et al., 2024), Borealis [data set], https://doi.org/10.5683/SP3/IV6SVJ, 2024a. 

Elias Chereque, A.: B-TIM snow for ERA5, Borealis [data set], https://doi.org/10.5683/SP3/HHIRBU, 2024b. 

Elias Chereque, A.: B-TIM snow for MERRA2, Borealis [data set], https://doi.org/10.5683/SP3/C5I5HN, 2024c. 

Elias Chereque, A.: B-TIM snow for JRA55, Borealis [data set], https://doi.org/10.5683/SP3/X5QJ3P, 2024d. 

Elias Chereque, A., Kushner, P. J., Mudryk, L., and Derksen, C.: Brown Temperature Index Model, Zenodo [code], https://doi.org/10.5281/zenodo.10044950, 2023. 

Environment and Climate Change Canada: Environment and Climate Change Canada Open Science Action Plan, 2021–2026, https://publications.gc.ca/collections/collection_2021/eccc/En4-446-2021-eng.pdf (last access: 28 October 2024), 2021. 

Essery, R.: A factorial snowpack model (FSM 1.0), Geosci. Model Dev., 8, 3867–3876, https://doi.org/10.5194/gmd-8-3867-2015, 2015. 

Essery, R., Morin, S., Lejeune, Y., and B Ménard, C.: A comparison of 1701 snow models using observations from an alpine site, Adv. Water Resour., 55, 131–148, https://doi.org/10.1016/j.advwatres.2012.07.013, 2013. 

Evan, A. and Eisenman, I.: A mechanism for regional variations in snowpack melt under rising temperature, Nat. Clim. Change, 11, 326–330, https://doi.org/10.1038/s41558-021-00996-w, 2021. 

Gelaro, R., McCarty, W., Suárez, M. J., Todling, R., Molod, A., Takacs, L., Randles, C. A., Darmenov, A., Bosilovich, M. G., Reichle, R., Wargan, K., Coy, L., Cullather, R., Draper, C., Akella, S., Buchard, V., Conaty, A., Silva, A. M. da, Gu, W., Kim, G.-K., Koster, R., Lucchesi, R., Merkova, D., Nielsen, J. E., Partyka, G., Pawson, S., Putman, W., Rienecker, M., Schubert, S. D., Sienkiewicz, M., and Zhao, B.: The Modern-Era Retrospective Analysis for Research and Applications, Version 2 (MERRA-2), J. Clim., 30, 5419–5454, https://doi.org/10.1175/JCLI-D-16-0758.1, 2017. 

Günther, D., Marke, T., Essery, R., and Strasser, U.: Uncertainties in Snowpack Simulations – Assessing the Impact of Model Structure, Parameter Choice, and Forcing Data Error on Point-Scale Energy Balance Snow Model Performance, Water Resour. Res., 55, 2779–2800, https://doi.org/10.1029/2018WR023403, 2019. 

Hedstrom, N. R. and Pomeroy, J. W.: Measurements and modelling of snow interception in the boreal forest, Hydrol. Process., 12, 1611–1625, https://doi.org/10.1002/(SICI)1099-1085(199808/09)12:10/11<1611::AID-HYP684>3.0.CO;2-4, 1998. 

Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A., Muñoz-Sabater, J., Nicolas, J., Peubey, C., Radu, R., Schepers, D., Simmons, A., Soci, C., Abdalla, S., Abellan, X., Balsamo, G., Bechtold, P., Biavati, G., Bidlot, J., Bonavita, M., De Chiara, G., Dahlgren, P., Dee, D., Diamantakis, M., Dragani, R., Flemming, J., Forbes, R., Fuentes, M., Geer, A., Haimberger, L., Healy, S., Hogan, R. J., Hólm, E., Janisková, M., Keeley, S., Laloyaux, P., Lopez, P., Lupu, C., Radnoti, G., de Rosnay, P., Rozum, I., Vamborg, F., Villaume, S., and Thépaut, J.-N.: The ERA5 global reanalysis, Q. J. R. Meteorol. Soc., 146, 1999–2049, https://doi.org/10.1002/qj.3803, 2020. 

Hersbach, H., Bell, B., Berrisford, P., Biavati, G., Horányi, A., Muñoz Sabater, J., Nicolas, J., Peubey, C., Radu, R., Rozum, I., Schepers, D., Simmons, A., Soci, C., Dee, D., and Thépaut, J.-N.: ERA5 hourly data on single levels from 1940 to present, Copernicus Climate Change Service (C3S) Climate Data Store (CDS) [data set], https://doi.org/10.24381/cds.adbb2d47, 2023.. 

Hock, R.: Temperature index melt modelling in mountain areas, J. Hydrol., 282, 104–115, https://doi.org/10.1016/S0022-1694(03)00257-9, 2003. 

Japan Meteorological Agency/Japan: JRA-55: Japanese 55-year Reanalysis, Daily 3-Hourly and 6-Hourly Data, Research Data Archive at the National Center for Atmospheric Research, Computational and Information Systems Laboratory [data set], https://doi.org/10.5065/D6HH6H41, 2013. 

Jennings, K. S. and Molotch, N. P.: The sensitivity of modeled snow accumulation and melt to precipitation phase methods across a climatic gradient, Hydrol. Earth Syst. Sci., 23, 3765–3786, https://doi.org/10.5194/hess-23-3765-2019, 2019. 

Kim, R. S., Kumar, S., Vuyovich, C., Houser, P., Lundquist, J., Mudryk, L., Durand, M., Barros, A., Kim, E. J., Forman, B. A., Gutmann, E. D., Wrzesien, M. L., Garnaud, C., Sandells, M., Marshall, H.-P., Cristea, N., Pflug, J. M., Johnston, J., Cao, Y., Mocko, D., and Wang, S.: Snow Ensemble Uncertainty Project (SEUP): quantification of snow water equivalent uncertainty across North America via ensemble land surface modeling, The Cryosphere, 15, 771–791, https://doi.org/10.5194/tc-15-771-2021, 2021. 

Kobayashi, S., Ota, Y., Harada, Y., Ebita, A., Moriya, M., Onoda, H., Onogi, K., Kamahori, H., Kobayashi, C., Endo, H., Miyaoka, K., and Takahashi, K.: The JRA-55 Reanalysis: General Specifications and Basic Characteristics, J. Meteorol. Soc. Jpn. Ser. II, 93, 5–48, https://doi.org/10.2151/jmsj.2015-001, 2015. 

Kosaka, Y., Kobayashi, S., Harada, Y., Kobayashi, C., Naoe, H., Yoshimoto, K., Harada, M., Goto, N., Chiba, J., Miyaoka, K., Sekiguchi, R., Deushi, M., Kamahori, H., Nakaegawa, T., Tanaka, T. Y., Tokuhiro, T., Sato, Y., Matsushita, Y., and Onogi, K.: The JRA-3Q Reanalysis, J. Meteorol. Soc. Jpn., 102, 49–109, https://doi.org/10.2151/jmsj.2024-004, 2024. 

Kuusisto, E.: Snow accumulation and snowmelt in Finland, Vesihallitus – National Board of Waters, Helsinki, 149 pp., Water Research Institute, ISBN 951-46-7494-4, 1984. 

Magnusson, J., Wever, N., Essery, R., Helbig, N., Winstral, A., and Jonas, T.: Evaluating snow models with varying process representations for hydrological applications, Water Resour. Res., 51, 2707–2723, https://doi.org/10.1002/2014WR016498, 2015. 

Menard, C. B., Essery, R., Krinner, G., Arduini, G., Bartlett, P., Boone, A., Brutel-Vuilmet, C., Burke, E., Cuntz, M., Dai, Y., Decharme, B., Dutra, E., Fang, X., Fierz, C., Gusev, Y., Hagemann, S., Haverd, V., Kim, H., Lafaysse, M., Marke, T., Nasonova, O., Nitta, T., Niwano, M., Pomeroy, J., Schädler, G., Semenov, V. A., Smirnova, T., Strasser, U., Swenson, S., Turkov, D., Wever, N., and Yuan, H.: Scientific and Human Errors in a Snow Model Intercomparison, Bull. Am. Meteorol. Soc., 102, E61–E79, https://doi.org/10.1175/BAMS-D-19-0329.1, 2021. 

Mortimer, C., Mudryk, L., Cho, E., Derksen, C., Brady, M., and Vuyvich, C.: Use of multiple reference data sources to cross validate gridded snow water equivalent products over North America, EGUsphere [preprint], https://doi.org/10.5194/egusphere-2023-3013, 2024. 

Mortimer, C., Mudryk, L., Derksen, C., Luojus, K., Brown, R., Kelly, R., and Tedesco, M.: Evaluation of long-term Northern Hemisphere snow water equivalent products, The Cryosphere, 14, 1579–1594, https://doi.org/10.5194/tc-14-1579-2020, 2020. 

Mudryk, L., Elias Chereque, A., Brown, R., Derksen, C., Luojus, K., and Decharme, B.: “Terrestrial Snow” in NOAA Arctic Report Card, https://arctic.noaa.gov/Report-Card/Report-Card-2020 (last access: 31 October 2024), 2020. 

Mudryk, L., Mortimer, C., Derksen, C., Elias Chereque, A., and Kushner, P.: Benchmarking of SWE products based on outcomes of the SnowPEx+ Intercomparison Project, EGUsphere [preprint], https://doi.org/10.5194/egusphere-2023-3014, 2024. 

Mudryk, L. R., Derksen, C., Kushner, P. J., and Brown, R.: Characterization of Northern Hemisphere Snow Water Equivalent Datasets, 1981–2010, J. Clim., 28, 8037–8051, https://doi.org/10.1175/JCLI-D-15-0229.1, 2015. 

Ohmura, A.: Physical Basis for the Temperature-Based Melt-Index Method, J. Appl. Meteorol. Climatol., 40, 753–761, https://doi.org/10.1175/1520-0450(2001)040<0753:PBFTTB>2.0.CO;2, 2001. 

Räisänen, J.: Changes in March mean snow water equivalent since the mid-20th century and the contributing factors in reanalyses and CMIP6 climate models, The Cryosphere, 17, 1913–1934, https://doi.org/10.5194/tc-17-1913-2023, 2023. 

Raleigh, M. S., Lundquist, J. D., and Clark, M. P.: Exploring the impact of forcing error characteristics on physically based snow simulations within a global sensitivity analysis framework, Hydrol. Earth Syst. Sci., 19, 3153–3179, https://doi.org/10.5194/hess-19-3153-2015, 2015. 

Reichle, R. H., Draper, C. S., Liu, Q., Girotto, M., Mahanama, S. P. P., Koster, R. D., and Lannoy, G. J. M. D.: Assessment of MERRA-2 Land Surface Hydrology Estimates, J. Clim., 30, 2937–2960, https://doi.org/10.1175/JCLI-D-16-0720.1, 2017.  

Robinson, D. A.: Evaluation of the collection, archiving and publication of daily snow data in the united states, Phys. Geogr., 10, 120–130, https://doi.org/10.1080/02723646.1989.10642372, 1989. 

Robinson, D. A. and Frei, A.: Seasonal Variability of Northern Hemisphere Snow Extent Using Visible Satellite Data, Prof. Geogr., 52, 307–315, https://doi.org/10.1111/0033-0124.00226, 2000. 

Sospedra-Alfonso, R. and Merryfield, W. J.: Influences of Temperature and Precipitation on Historical and Future Snowpack Variability over the Northern Hemisphere in the Second Generation Canadian Earth System Model, J. Clim., 30, 4633–4656, https://doi.org/10.1175/JCLI-D-16-0612.1, 2017. 

Sturm, M.: White water: Fifty years of snow research in WRR and the outlook for the future, Water Resour. Res., 51, 4948–4965, https://doi.org/10.1002/2015WR017242, 2015. 

Thackeray, C. W., Qu, X., and Hall, A.: Why Do Models Produce Spread in Snow Albedo Feedback?, Geophys. Res. Lett., 45, 6223–6231, https://doi.org/10.1029/2018GL078493, 2018. 

Vionnet, V., Mortimer, C., Brady, M., Arnal, L., and Brown, R.: Canadian historical Snow Water Equivalent dataset (CanSWE, 1928–2020), Earth Syst. Sci. Data, 13, 4603–4619, https://doi.org/10.5194/essd-13-4603-2021, 2021. 

Walter, T. M., Brooks, E. S., McCool, D. K., King, L. G., Molnau, M., and Boll, J.: Process-based snowmelt modeling: does it require more input data than temperature-index modeling?, J. Hydrol., 300, 65–75, https://doi.org/10.1016/j.jhydrol.2004.05.002, 2005. 

WMO (Ed.): Guide to instruments and methods of observation: Volume II – Measurement of Cryospheric Variables, 2018th edn., World Meteorological Organization, Geneva, WMONo., 8, 52 pp., 2018. 

Zhang, Y.-F., Hoar, T. J., Yang, Z.-L., Anderson, J. L., Toure, A. M., and Rodell, M.: Assimilation of MODIS snow cover through the Data Assimilation Research Testbed and the Community Land Model version 4, J. Geophys. Res.-Atmos., 119, 7091–7103, https://doi.org/10.1002/2013JD021329, 2014. 

Download
Short summary
We look at three commonly used snow depth datasets that are produced through a combination of snow modelling and historical measurements (reanalysis). When compared with each other, these datasets have differences that arise for various reasons. We show that a simple snow model can be used to examine these inconsistencies and highlight issues. This method indicates that one of the complex datasets should be excluded from further studies.