Articles | Volume 18, issue 8
https://doi.org/10.5194/tc-18-3495-2024
https://doi.org/10.5194/tc-18-3495-2024
Research article
 | 
08 Aug 2024
Research article |  | 08 Aug 2024

Reanalyzing the spatial representativeness of snow depth at automated monitoring stations using airborne lidar data

Jordan N. Herbert, Mark S. Raleigh, and Eric E. Small
Abstract

Automated snow station networks provide critical hydrologic data. Whether point observations represent snowpack at larger areas is an enduring question. Leveraging the recent proliferation of airborne lidar snow depth data, we revisit the question of snow station representativeness at multiple scales surrounding 111 stations in Colorado and California (USA) from 2021–2023 (n=476 total samples). In about 50 % of cases, station depths were at least 10 cm higher than areal-mean snow depth (from lidar) at 0.5 to 4 km scales. The nearest 50 m lidar pixels had lower bias and were more often representative of the areal-mean snow depth than coincident stations. The closest 3 m lidar pixel often agreed with station snow depth to within 10 cm, suggesting differences between station snow depth and the nearest 50 m lidar pixel result from highly localized conditions and not the measurement method. Representativeness decreased as scale increased up to ∼6 km, mainly explained by the elevation of a site relative to the larger area. Relative values of vegetation and southness did not have significant impacts on site representativeness. The sign of bias at individual snow stations is temporally consistent, suggesting the relationship between station depth and that of the surrounding area may be predictable. Improving understanding of snow station representativeness could allow for more accurate validation of modeled and remotely sensed data.

1 Introduction

Mountain snowpack provides water to over a billion people worldwide (Dozier et al., 2016) and comprises approximately half of freshwater available in the western United States (Li et al., 2017). Snowmelt impacts agricultural activity (Qin et al., 2020) and ecosystems (Blankinship et al., 2014; Dollery et al., 2006) and influences the magnitude and frequency of natural hazards such as wildfires, floods, and droughts (Dierauer et al., 2019; Musselman et al., 2018; Westerling et al., 2006). The amount and timing of water availability in snowmelt-dominated watersheds is dependent on snowpack characteristics. Despite recent advances, existing remote sensing techniques do not allow for spatially and temporally continuous monitoring of snow water equivalent (SWE) in the complex terrain of mountain watersheds (Lettenmaier et al., 2015). Instead, assessments of water stored in mountain snowpack for hydrologic research and applications (e.g., streamflow forecasting) rely on a combination of ground-based snow sampling, remote sensing, and modeling (Pagano et al., 2009).

Automated stations (hereafter: snow stations), such as the Natural Resource Conservation Service's (NRCS) Snow-Telemetry (Snotel) network, provide temporally continuous, high-quality measurements of snow depth and SWE at over 900 locations throughout the western United States. Snow stations are strategically located to maximize their utility for water supply forecasts. Sites with more persistent snow (e.g., higher elevation, northern aspects) are preferred, since locations with more persistent snow provide data for streamflow forecasts longer into the ablation season (NRCS, 2011). Stations are built on flat surfaces, below the treeline (between 2745–3350 m above sea level), and in areas shielded from high winds (Molotch and Bales, 2006; NRCS, 2011; Woelders et al., 2020). The specific requirements for snow station locations, combined with their uneven distribution across the landscape, may increase the potential for bias when using station data to represent larger areas such as an entire watershed.

In addition to aiding water supply forecasts, snow station data have been applied to a wide array of applications in snow hydrology. Snow station data are frequently used to validate models (Pan et al., 2003; Schneider and Molotch, 2016) and as ground truth references for remotely sensed data (Klein and Barnett, 2003; Lievens et al., 2022; Painter et al., 2016). In these cases, station data are used as the “true” values against which the model and remotely sensed data are validated. However, the data sets being validated frequently represent areas on the 100 m to 1 km scale, much larger than the ∼1–3 m sampling area of a snow station. Another common use for snow station data is as input for data assimilation frameworks (DeChant and Moradkhani, 2011; Margulis et al., 2019; Slater and Clark, 2006; Smyth et al., 2020; Barrett, 2003). These applications also apply snow station data to represent the (usually much larger) scale of the model resolution. Finally, station data have been spatially interpolated into gridded products (Broxton et al., 2019; Molotch et al., 2005; López-Moreno et al., 2011). Even though the interpolation may include the influence of landscape factors such as elevation or aspect, the representativeness of the snow station data is typically unknown and is thus unaccounted for in the interpolation scheme.

Care is warranted when extrapolating snow station data to larger areas because the distribution of snow across a landscape can be highly variable, especially at 1–100 m scales (Blöschl, 1999; Clark et al., 2011; Scipión et al., 2013). As a result, many studies have assessed the utility of point data to represent larger areas. Evaluations of point measurement representativeness suggest single measurements are inadequate to represent areas as small as 10 m2 (López-Moreno et al., 2011) or 30 m2 (Fassnacht et al., 2018), and over 50 point measurements are required to represent an area of 300 m2 (Watson et al., 2006). Other investigations used manual sampling of snow depth and SWE combined with binary regression trees to determine how snow properties vary surrounding a limited number of snow stations (Meromy et al., 2013; Molotch and Bales, 2005). These results suggested that half or fewer of the stations yielded snow depths within 10 % of the mean snow depth of the surrounding area (areal-mean snow depth). Embedded sensor networks surrounding an operational snow course and snow station demonstrated that neither the snow course nor the station represented the areal-mean snow depth to within 20 %–30 % at the 1, 4, or 16 km2 scales due to differences in the surrounding topography (Rice and Bales, 2010).

Other studies have used high-spatial-resolution mapping of snow depth from airborne lidar to assess snow station representativeness, though these efforts were limited in scope. Grünewald and Lehning (2011) used data from five snow stations and three lidar surveys to assess if snow stations can accurately represent the change in snow depth with altitude. Grünewald and Lehning (2015) used lidar surveys from six different watersheds (one survey per watershed), finding sites that met the criteria for snow station locations (as opposed to using real station data) to assess snow station representativeness. These efforts found that snow stations typically overestimate SWE, possibly due to the sampling locations occurring on flat terrain compared to the more characteristically sloping mountainous terrain of the surrounding area. Of the sites that were deemed representative of the surrounding area (within 10 % of the areal mean), there were no discernible similarities in topographic attributes that would serve as a predictor for “well-placed” sites.

The aforementioned studies were limited in the quantity and spatial extent of study areas due to the labor requirements of manually collecting samples and the limited availability of high-resolution lidar snow depth data. The recent proliferation of lidar snow depth data in the western US made possible by the Airborne Snow Observatory (ASO; Painter et al., 2016) provides an opportunity to assess the representativeness of snow monitoring stations using high-confidence, spatially distributed lidar data that are colocated with snow station locations. We utilize lidar snow depth data available in watersheds in Colorado and California to revisit the question of how representative the locations of snow monitoring stations are compared to the surrounding area and whether the relationship is consistent over time.

Here, we address the following questions. (1) How variable is lidar snow depth around operational snow stations? (2) What is the distribution of relative snow depth (RSD; defined in Sect. 2.2.2) values, and how does RSD change when calculated for different spatial scales and point snow depths derived from different sensing techniques (i.e., in situ vs. remotely sensed)? (3) Do individual sites demonstrate repeatable patterns of RSD sign and magnitude over time? Finally, (4) What impact do relative land cover and topography variables (specifically, elevation, fractional vegetation, and southness) have on RSD? While answering these questions we focus on snow depth (not SWE) because snow depth is the variable measured directly both by airborne lidar and at snow stations. See Sect. 2.1.2 for further explanation regarding this decision.

2 Methods

2.1 Study sites and data

We selected locations in Colorado and California that have coincident airborne lidar and snow station data over the interval February 2021 through June 2023. In Colorado, we utilized 40 lidar surveys in 13 watersheds, containing 48 active snow stations, totaling 138 instances of coincident lidar and snow station data. All Colorado lidar surveys were carried out in April and May, typically with two surveys per year per basin. More data were available in California, where we utilized 108 lidar surveys in 13 watersheds, containing 63 active snow stations, totaling 338 coincident lidar–station comparisons. California surveys were conducted between January and June, with most surveys between March and May. Locations of the lidar surveys and snow stations are summarized in Fig. 1 (and Tables S1 and S2 in the Supplement). Between both states, we analyzed 476 instances of coincident lidar–station data.

In the remainder of Sect. 2.1 we provide detailed descriptions of the data sets we employ in this investigation and the scales at which we employ them.

https://tc.copernicus.org/articles/18/3495/2024/tc-18-3495-2024-f01

Figure 1Locations of lidar surveys and snow stations in (a) California and (b) Colorado, with watersheds labeled. Two stations (Stillwater Creek and Spratt Creek) are highlighted as these are used in subsequent examples.

2.1.1 Snow station data

The NRCS and the California Department of Water Resources (CA-DWR) operate snow stations which monitor snow depth, SWE, and meteorological parameters at select locations in snow-dominated watersheds. These stations collect snow depth data using an ultrasonic sensor (precision: 13 mm) and SWE data by measuring the mass above a snow pillow (precision: 2.5 mm) (NRCS, 2011). Sensor precision values are not reported by CA-DWR but should be similar to the NRCS values since they use similar equipment. The typical spatial support (Blöschl, 1999) is 9 m2 for SWE (snow pillow) and ∼1 m2 for depth (ultrasonic sensor).

Although SWE is the critical variable for understanding water storage, we conduct our analyses using snow depth because it is the variable directly retrieved by lidar surveys. Lidar SWE products use modeled density (Painter et al., 2016), increasing the uncertainty of the measurement as compared to snow depth. Of the existing literature, one study (Molotch and Bales, 2005) directly measured SWE using a federal sampler to get distributed measurements of SWE but was limited by the total amount of samples collected. Most other studies (e.g., Grünewald and Lehning, 2011, 2015; Meromy et al., 2013) converted snow depth to SWE by assuming a uniform snow density across the study site. Snow density is not uniform across the landscape and may contribute considerable uncertainty in SWE estimations based on lidar data (Meehan et al., 2023; Raleigh and Small, 2017; Wetlaufer et al., 2016). Converting values to SWE by assuming a uniform snow density increases the potential error as compared to retaining the values as snow depth. Thus, we keep our analyses in terms of snow depth. Any results herein would be identical if we converted to SWE by multiplying snow depth with a chosen snow density (e.g., Grünewald and Lehning, 2011, 2015; Meromy et al., 2013).

We downloaded daily NRCS Snotel and CA-DWR snow depth data from all sites within the bounds of watershed areas surveyed by ASO with airborne lidar in Colorado and California from 2021 to 2023. We acquired site coordinates (latitudes and longitudes) from the NRCS and CA-DWR websites. Due to the importance of accurate location data for this study, we verified the locations of each snow station using visual inspection of high-resolution satellite imagery in Google Earth. We updated site coordinates in locations where the provided coordinates were visibly offset from an identifiable snow station. The coordinates were updated to the fifth decimal place in decimal degrees, providing ∼1 m accuracy for the location of the center of the snow pillow. We assume that the depth sensor is located over the center of the pillow (which can be identified in the satellite images); however, we recognize that this is not always true. The location of four CA-DWR sites within lidar-surveyed watersheds could not be verified and were excluded from the analysis. Site coordinates are available in Tables S1 and S2.

We carried out quality control on the snow depth data to ensure accuracy. NRCS data were free from obvious error, while CA-DWR data frequently displayed unnatural jumps in snow depth. In many cases, the snow depth sensor recorded meter-scale changes in daily snow depth, often followed by a change in the opposite direction of the same magnitude. This likely results from a lack of quality control measures conducted on CA-DWR snow depth data. We discarded clearly erroneous data that recorded unnatural multidirectional shifts of greater than 0.5 m. Upon visual inspection of the data, the 0.5 m threshold removed the unnatural shifts in snow depth.

2.1.2 Lidar data

We utilize all ASO lidar snow depth data available in Colorado and California from 2021–2023. These data sets are available as gridded rasters at 50 and 3 m resolutions in the Universal Transverse Mercator (UTM) coordinate system, WGS84. The 3 m product is produced by taking the difference between snow-on and snow-off point clouds, and the 50 m product is an aggregation of the 3 m data (Painter et al., 2016). We use the 50 m data sets to analyze the distribution of snow depth surrounding a snow station and calculate the areal-mean snow depth at a range of larger scales (analyses discussed in Sect. 2.2). The 50 m scale is sufficient to capture snow depth distribution across the landscape at coarser analysis scales and requires much less storage and computational expense to manage compared to the 3 m data sets. We employ a subset of 3 m gridded snow depth data, extracting the pixel coincident with the snow station.

Snow depth is retrieved from lidar data by calculating the difference in surface elevation between snow-on and snow-off surveys. The 3 m snow depths record mean absolute errors of <8 cm, and 50 m snow depths record mean absolute errors of <2 cm (Painter et al., 2016).

It is worth noting that we do not exclude any lidar data based on proximity to human activities (e.g., compacted snow in ski areas, deeper snow due to snow-making, snow removal on roads) which may impact areal-mean snow depths. Snow stations are often built in secluded locations, which we expect to be minimally impacted by human activities, but this is limited to only the small (∼30 m) area surrounding a snow station. Lidar surveys encompassing ski areas, towns, and roads have the potential to record snow depths that do not represent the “natural” snow depth that would have been measured in the absence of human impacts. We chose to not remove any lidar surveys due to the difficulty of finding an objective method to do so and the changing degree of human impact at a site with scale. We found that at least eight snow stations are near ski areas but did not find a consistent bias in the snow depths across those sites.

2.1.3 Land cover and topography data

We obtained digital elevation models and vegetation data sets surrounding all snow stations employed in this study. For the digital elevation model, we use the 10 m resolution USGS National Elevation Dataset (Gesch et al., 2018). These data are used for their elevation values and to calculate southness. Southness serves as a metric for how exposed an area is to solar radiation in the Northern Hemisphere and is calculated as the sine of the slope multiplied by the cosine of the aspect (Dozier and Frew, 1990). For vegetation, we downloaded the National Land Cover Database percent tree cover data set (2019), which provides fractional vegetation (FVEG) at 30 m resolution (Dewitz, 2021). We bilinearly resampled all land cover and topography data to match the 50 m spatial resolution of the lidar data.

2.1.4 Data representing the snow station

We use different data sources to represent snow depth at the snow station. In doing so, we can establish if any biases result from using data with different spatial coverage and sampling methodologies. These sources include the reported snow station snow depth (station SD), the coincident 50 m resolution lidar pixel (50 m SD), and the coincident 3 m resolution lidar pixel (3 m SD). These data sources have different spatial coverages (1–3 m vs. 50 m) and use different sampling methodologies (in situ vs. lidar). For our analyses we primarily use 50 m SD and station SD; station SD assesses the performance of the station itself while the 50 m SD assesses the general location of the snow station within the landscape.

https://tc.copernicus.org/articles/18/3495/2024/tc-18-3495-2024-f02

Figure 2The spatial distribution at 50 m resolution of (a) lidar snow depth, (b) elevation, and (c) fractional vegetation. The squares represent spatial scales of 0.5 km (solid), 1 km (dashed), and 4 km (dotted). (d) Cumulative density functions (CDFs) of snow depth at each of the three scales with 50 m SD and station SD plotted on the distribution for the Stillwater Creek snow station in Colorado on 16 April 2023.

https://tc.copernicus.org/articles/18/3495/2024/tc-18-3495-2024-f03

Figure 3The spatial distribution at 50 m resolution of (a) lidar snow depth, (b) elevation, and (c) fractional vegetation. The squares represent spatial scales of 0.5 km (solid), 1 km (dashed), and 4 km (dotted). (d) Cumulative density functions (CDFs) of snow depth at each of the three scales with 50 m SD and station SD plotted on the distribution for the Spratt Creek snow station in California on 31 March 2023. Note that the x axis in (d) is cut off and that there are snow depth values exceeding 3 m at the 4 km scale.

2.2 Analyses

In this section, we describe the analyses conducted. First, we present the spatial scales at which we conduct the analyses, and we then provide details on each analysis in the order of the research questions it aims to address.

2.2.1 Spatial scales

We conduct our analyses at three spatial scales typically employed in remote sensing and modeling applications: 0.5 km × 0.5 km, 1 km × 1 km, and 4 km × 4 km grid squares (hereafter: 0.5, 1, and 4 km scales) (Figs. 2, 3). The snow stations were centered within these squares (as in previous studies); however, we acknowledge that snow stations will rarely be centered in gridded products (remote sensing or distributed models). We separately repeated the same analyses using the 0.5 km MOD10A1F grid from the MODIS/Terra Snow Cover Daily L3 Global 500 m SIN Grid data set (Riggs and Hall, 2020), and the results (not shown) were not significantly changed as compared to the 0.5 km grid centered around a snow station.

We also expand on the three discrete scales to more directly assess how representativeness and the influences of land cover and topography change with scale. Beginning at the point scale, we expand outward in 50 m increments up to the 8 km scale. In doing so, we are able to assess the relationship of scale and representativeness as well as determine if the trends we observe continue beyond the 4 km scale. At some sites, expanding the analysis to scales greater than 4 km results in an analysis area that extends beyond the bounds of the lidar scan. For the expanded scale analysis, we only included sites in which 90 % or more of the grid cells contain snow depth values at the 8 km scale. This reduced the number of snow stations in the analysis to 56 (from 111) but ensured that the results were not influenced by increased amounts of null data at larger scales.

2.2.2 Snow depth variability

To gauge snow depth variability surrounding a snow station we evaluate the distribution of snow depths at each scale. To do so, we calculate the 5th–95th percentile range of snow depth values using the 50 m resolution lidar data at each coincident lidar–station pair (Figs. 2d, 3d). We then determine where point snow depth observations (station SD and 50 m SD) fall within the cumulative density function (CDF) of 50 m snow depths at each scale. We present the results of this analysis in Sect. 3.1.

2.2.3 Relative snow depth and representativeness

We assess the spatial representativeness of a snow station by comparing point snow depth to the areal-mean snow depth. To do so, we employ relative snow depth (RSD). RSD is calculated by subtracting the areal-mean snow depth from the point snow depth representing the snow station, following Eq. (1):

(1) RSD = point snow depth - areal-mean snow depth

We use RSD to determine if extrapolation of the point snow depth to the larger area would overestimate (if positive) or underestimate (if negative) the areal-mean snow depth. We calculate the RSD for each spatial scale, using station SD and 50 m SD as point data sources. We deem a site to be representative if the RSD is within ±10 cm. We acknowledge that the range of “representative” RSD values varies based on the application and that there is subjectivity in what constitutes a representative site (similarly discussed in Meromy et al., 2013). Our results could easily be adjusted using a different range of acceptable values. We present a probability density function in Sect. 3.2 to illustrate the distribution of RSD values irrespective of our classification of representativeness. Unlike previous investigations, we do not use a percent difference from the mean as an indicator of representativeness, as percentages can be overly influenced by the magnitude of snow depth. The data we employ encompass a wide variety of locations and times within the snow season, meaning snow depth magnitudes are highly variable. As such, the magnitude difference is a more interpretable metric.

Snow stations are strategically placed on the landscape to maximize their utility for water supply forecasts (NRCS, 2011). We assess the impact of this strategic placement by calculating RSD for all possible snow station locations at each study site. Using lidar data, we calculate the RSD value by sequentially setting each pixel in a study area as the snow station location. For example, we calculate 100 RSD values at the 0.5 km scale for the 100 pixels (each 50 m resolution) within the study area. We use these data to create a distribution of expected RSD values at a given scale (term: virtual RSD). We then compare the distribution of the virtual RSD values to the distribution of real RSD values (across all 476 lidar–station survey pairings) to see how strategic placement of snow stations compares to expected RSD values. The results of these analyses are presented in Sect. 3.2.

2.2.4 Consistency of RSD values

Is the sign and magnitude of RSD at a site consistent through time? We address this question by calculating RSD at each snow station over all available lidar surveys in the 3-year period. For this temporal consistency assessment, we include all sites that have data points spanning at least three lidar surveys across at least 2 years (n=71 sites). To assess temporal consistency at snow stations, we partition the data into three groups: those where the median RSD is less than −0.1 m, between −0.1 to 0.1, and greater than 0.1 m. We then analyze the distribution of RSD values within these three groups. Additionally, we assess how RSD varies throughout the season by plotting RSD against days to snow station meltout date for each site.

2.2.5 Land cover and topography analysis

We assess variations in land cover and topography to test whether there are any discernable effects on RSD (Figs. 2b, c, 3b, c). To do so, we calculated relative elevation, relative fractional vegetation (FVEG), and relative southness. These metrics are similar to RSD; they are calculated by subtracting the areal-mean value of the variable from the pixel value closest to the snow station. For example, a positive relative fractional vegetation value signifies that the fractional vegetation value representing the snow station is greater than the mean fractional vegetation of the surrounding area. We use linear regressions to determine if there are significant relationships between the relative land cover or topography variables and RSD.

3 Results

3.1 Snow depth variability

The spatial variability of snow depth influences the likelihood that a snow station is representative of the surrounding area. A higher range of snow depths increases the maximum possible magnitude of RSD, whereas a limited snow depth range has a smaller maximum RSD. For example, a site with a 20 cm range of snow depths would have a maximum RSD value of 10 cm (assuming a normal distribution), guaranteeing the station to be representative. Recall that we define a representative site as being within 10 cm of the areal-mean. Here, we examine the statistical distribution of snow depth surrounding snow stations and its role in site representativeness, with a focus on the 0.5 km scale.

The 5th–95th percentile range of snow depth varies greatly between sites and between study region (Colorado vs. California, Fig. 4a, f). The mode for the 5th–95th percentile range is 0.4–0.5 m in Colorado and between 0–0.1 m in California; the latter is a result of lidar surveys occurring when some study sites were mostly snow-free. Aside from these low values, most sites have a range of snow depths between 0.3–0.6 m at the 0.5 km scale in both Colorado and California. The maximum 5th–95th percentile range is about 1 m in Colorado and 2.4 m in California, likely due to deeper snowpacks in California. The median range is 0.46 m in Colorado and 0.61 m in California.

The CDF plots demonstrate a range of possible scenarios created from different snow depth distributions. Sites characterized by lower snow depth variability (Fig. 4b, c, g, h) are less likely to have point snow depths far from the median due to the limited range of snow depths, while sites with higher snow depth variability (Fig. 4d, e, i, j) allow for greater differences between the median and point snow depth. For example, at the Michigan Creek Snotel site (Fig. 4b) the 50 m SD and station SD values correspond to the 7th and 95th percentiles, yet both values are within 0.1 m depth of the median value. Conversely, at sites with greater snow depth variability (e.g., Scotch Creek and Huysink; Fig. 4e, j), high percentiles corresponding with the station SD are accompanied by large differences from the median (0.46 and 0.95 m, respectively). These results highlight that snow depth variability differs from site to site and that percentile from the median is influenced by the range of snow depth values. Thus, using the percentile proximity to the median is not an effective indicator of representativeness at sites with low or moderate snow depth variability. Identifying snow depth variability at sites is one important factor that controls the likelihood that a site will be representative of the surrounding area, since sites with low variability are more likely to be yield depths close to the station SD.

https://tc.copernicus.org/articles/18/3495/2024/tc-18-3495-2024-f04

Figure 4Histogram plot of the 5th–95th percentile lidar snow depth values around snow stations in (a) Colorado (138 sites) and (f) California (338 sites). Cumulative density function plots at select sites in (b–e) Colorado and (g–j) California spanning low to high snow depth variability at the 0.5 km scale. Point snow depths are plotted with their corresponding probabilities within the snow depth distribution in blue for station SD and yellow for 50 m SD. Vertical black lines represent the range of snow depth values which are within ±10 cm of the median snow depth.

Download

3.2 Site representativeness

We now examine the distribution of RSD values and how the distribution changes when RSD is calculated using different scales and point snow depths. This is compared to the distribution of virtual RSDs, which represent the distribution of RSDs calculated when considering each lidar pixel in the study area to be a hypothetical station location. The virtual RSD distribution provides a distribution of RSDs if a snow station was randomly placed within the landscape.

When using station SD as the point measurement, 35 %, 33 %, and 28 % of the snow stations are representative at the 0.5, 1, and 4 km scales, respectively (Fig. 5, Table 1). Root-mean-square error (RMSE) is 0.46, 0.48, and 0.54 m for the same respective scales. Approximately 50 % of RSD values are biased high (RSD >0.1 m), while only ∼15 %–21 % are biased low (RSD <-0.1 m) at all three scales.

https://tc.copernicus.org/articles/18/3495/2024/tc-18-3495-2024-f05

Figure 5(a–c) Probability density functions of RSD at the 0.5, 1, and 4 km scales, using 50 m SD, station SD, and the virtual station locations as point values for all sites. (d–f) The relative distribution of RSD values that are less than −10 cm (low), within 10 cm (in), or above 10 cm (high) for each of the point values at each scale. The vertical grey lines at −0.1 and 0.1 m represent the delineations between low-biased, representative, and high-biased sites.

Download

Table 1The percentage of coincident lidar–snow station data points where RSD is less than −10 cm (Low), within ±10 cm (In), or above 10 cm (High) for each scale, using the 50 m lidar, station SD, and virtually placed snow stations. Median, mean, and RMSE of the RSD values are also presented.

Download Print Version | Download XLSX

Sites are more frequently representative when using 50 m SD to represent the station (as compared to station SD). Approximately 50 % of points are representative at the 0.5 and 1 km scales. Representativeness again decreases with scale, with 38 % of points being representative at the 4 km scale (Fig. 5, Table 1). Relative to the station SD case, RMSE values are lower when using 50 m SD, yielding values of 0.20, 0.24, and 0.35 m for the 0.5, 1, and 4 km scales (Table 1), respectively. At all three scales the proportion of high-biased sites is greater than the proportion low-biased sites. However, the difference between high- and low-biased sites is less pronounced when using 50 m SD vs. station SD.

The virtual snow station analysis suggests that 50 m SD locations more effectively represent the surrounding area than if they were placed randomly (Fig. 5 and Table 1). Compared to virtual locations, real site placement (using 50 m SD) increases the frequency of representative sites and reduces the frequency of low-biased sites at all three scales. The frequency of high-biased sites is approximately equal between the 50 m SD and virtual site placement values at all three scales. We compare the 50 m SD and virtual stations to each other because they are generated from the same data set. In doing so, the comparisons we make are a direct reflection of the location within the study area, and not any biases in sampling methodology or spatial coverage. It is important to note that both the 50 m SD and virtual stations perform better than the station SD. We analyze the reason for decreased representativeness when using station SD in Sect. 3.3.

Next, we expand the spatial scales of our analysis at 50 m increments from 0.1 to 0.8 km scales to more fully examine the effect of scale on representativeness. For both the 50 m SD and station SD the proportion of representative sites decreases with scale, plateauing at a minimum value near 20 % at the ∼6 km scale (Fig. 6). The main differences between the 50 m SD and station SD results are that at the smaller scales (0.1 to 1 km) the 50 m SD values have higher proportions of representativeness, and the high bias for the station SD RSD values is consistently near 50 % regardless of scale.

These results highlight that (1) point snow depths are more likely to be representative of the surrounding area at finer scales than at coarser scales, (2) non-representative sites are more likely to be biased high than biased low at all three scales and for all data sources, and (3) high biases are most pronounced when using station SD.

https://tc.copernicus.org/articles/18/3495/2024/tc-18-3495-2024-f06

Figure 6The percentage of low-biased, representative, or high-biased RSD values for each scale from 0.1 to 0.8 km when using (a) 50 m SD as the point value or (b) station SD as the point value.

Download

3.3 Point snow depth comparisons

As exemplified in Fig. 5, the source and spatial coverage of point snow depth observations influences whether a site qualifies as representative. RSD calculated using station SD tends to have a higher bias than RSD calculated using 50 m SD (Fig. 5). There are two possible explanations for this bias: (1) snow stations tend to be installed in locations with relatively deep snow compared to the surrounding 50 m area, or (2) there is a systematic bias caused by the difference between remotely sensed lidar and in situ station ultrasonic measurements of snow depth. To assess the cause of these differences we now compare the 50 m SD, the 3 m SD, and the station SD with each other (Fig. 7).

Station SDs are systematically higher than the 50 m SDs, with 48 % of station SDs being over 10 cm greater than their 50 m SD counterparts and only 9 % being at least 10 cm less than the 50 m SD (Fig. 7a). The station SD and 3 m SD match each other more closely (Fig. 7b); 64 % of points are within ±10 cm of each other, with minimal bias. The 3 m SD to 50 m SD comparison (Fig. 7c) yields similar results to those of the snow station SD to 50 m SD comparison (Fig. 7a), with a similar high bias. The similarity between the 3 m SD and station SD values suggest that the high bias in RSD at stations is not caused by differences in measurement technique (i.e., airborne lidar vs. a ground-based ultrasonic sensor). Thus, we conclude that the high bias reported by the station SD and 3 m SD is a result of differences in snow depth at the station locations compared to the surrounding 50 m area.

https://tc.copernicus.org/articles/18/3495/2024/tc-18-3495-2024-f07

Figure 7Scatter plots comparing the three different options for point snow depth: (a) station SD vs. 50 m SD, (b) station SD vs. 3 m SD, and (c) 3 m SD vs. 50 m SD. Points inside the black lines are within ±10 cm of each other. Histogram insets represent percentage of points that are below, within, or above the ±10 cm threshold represented by the black lines.

Download

3.4 Temporal consistency of RSD at snow stations

RSD values at individual sites demonstrate temporal consistency from survey to survey at all three scales (Fig. 8). For this analysis, we used sites with three or more lidar surveys. We grouped the sites into three categories: those with median RSD values less than −0.1 m (low bias), between −0.1 and 0.1 m (unbiased), or greater than 0.1 m (high bias) at the 0.5, 1, and 4 km scales (Fig. 8a–c). Violin plots of the three categories (Fig. 8d–f) illustrate a divide between the three groups. Sites in the low-biased group are classified by almost exclusively negative RSD values, whereas sites in the high-biased group are classified by almost exclusively positive RSD values. For example, at the 0.5 km scale, 64 of 65 RSD values in the low-biased group are less than or equal to zero. Similarly, 83 out of 90 RSD values are greater than or equal to zero in the high-biased group. The proportions of low and high sites are similar at the 1 and 4 km scales. These results demonstrate that certain sites exhibit consistency in the sign of RSD values through time.

The temporal consistency of RSD at a site must be influenced by more than just relative elevation. As demonstrated in Sect. 3.3, the magnitude of RSD values increases in tandem with the increased magnitude of relative elevation values. However, there is still a clear temporal consistency in the sign of RSD at the smaller (0.5 and 1 km) scales, where relative elevation has minimal influence (Fig. 8a, b). The 0.5 km scale is particularly striking; relative elevation magnitudes are generally less than 25 m (Fig. 8a), but there is still a clear delineation of low-biased and high-biased sites (Fig. 8d, f). The 4 km scale does exhibit an increased number of low- and high-biased sites and higher-magnitude RSD values, which may be a result of higher-magnitude relative elevation values.

https://tc.copernicus.org/articles/18/3495/2024/tc-18-3495-2024-f08

Figure 8The temporal consistency of relative snow depth at snow stations with three or more lidar surveys. (a–c) Each snow station (x axis) plotted against the RSD calculated from the 50 m SD for each lidar survey at 0.5, 1, and 4 km scales. Crosses represent individual RSD values, and the lines represent the range of RSD values at a given site. Stations are ordered from lowest to highest mean RSD for each scale (snow stations are thus in different orders for each scale). Relative elevation values are also plotted as black circles on the right y axis. (d–f) Distribution plots of qualitatively grouped snow stations that are typically biased low, unbiased, or biased high for the three scales. The black bars with circles represent the median and interquartile range of the RSD values.

Download

The above paragraphs analyzed trends of RSD at a site regardless of timing. Here, we assess how RSD varies throughout the season. Figure 9 displays relative snow depth in relation to days from snow station meltout for three selected sites at all three spatial scales. We selected sites that yield typically negative (Devil's Postpile), variable (Dana Meadows), or positive (Ostrander Lake) RSD values. These data demonstrate that RSD does change within the snow season. At Devil's Postpile and Ostrander Lake, RSD magnitudes reach their peak in the ablation season, approximately ∼50–25 d from meltout. Dana Meadows is less consistent in the timing maximum magnitude of RSD, with maximums in 2021 and 2022 occurring in the late ablation season, while the 2023 maximum occurred nearing peak snow depth. These data also suggest that scale influences the magnitude of RSD, but the sign and trend are consistent between all scales. We display three sites from California because California sites have more lidar surveys and surveys that span a greater breadth of the snow season. Colorado sites display similar trends to the sites shown in Fig. 9.

https://tc.copernicus.org/articles/18/3495/2024/tc-18-3495-2024-f09

Figure 9(a–c) Days to meltout vs. relative snow depth for all lidar surveys at select sites at all three scales. (d–f) Snow depth time series as recorded by snow stations for years with coincident lidar–station data at the selected site. Note that 2021 data are missing at Devil's Postpile.

Download

3.5 Topography and fractional vegetation

In this section we examine question 4: what impact do relative land cover and topography variables have on RSD? We found significant correlations between relative elevation and RSD (calculated using 50 m SD) but no significant correlations between relative fractional vegetation or relative southness and RSD. However, regressions of fractional vegetation and southness against snow depth at each site at the 4 km scale (i.e., a regression of all 50 m lidar snow depths values against the coincident fractional vegetation or southness value at a site) demonstrated significant relationships (p<0.05) at 86 % and 93 % of sites for fractional vegetation and southness, respectively (results not shown). These results indicate that fractional vegetation and southness impact snow depth; however, the relative variables do not have significant correlations with relative snow depth. We discuss possible reasons for this in Sect. 4.2. We focus on relationships between RSD and relative elevation hereafter and include results related to relative fractional vegetation and relative southness in the Supplement.

Analysis of the three primary scales demonstrates that the correlation (as indicated by R2) between RSD and relative elevation increases with scale (Fig. 10). At the 4 km scale, the slope of the linear regression indicates that RSD increases by 16 cm for every 100 m of relative elevation (R2=0.3). The positive slope is consistent with our expectation of lapse rates of temperature and precipitation producing deeper snow at higher elevations.

The expanded scale analysis (0.1 to 8 km scales) allows us to better understand the interplay of scale and elevation effects on RSD. As discussed in Sect. 2.2, we only include sites in which 90 % or more of the grid cells contain valid snow depth values at the 8 km scale. The correlation between RSD and relative elevation (as indicated with R2) steadily increases with scale until ∼7 km, where it levels off at a value of ∼0.47 (Fig. 11a). The relationship between RSD and relative elevation is significant (p<0.05) at scales greater than or equal to 0.5 km (Fig. 11a).

https://tc.copernicus.org/articles/18/3495/2024/tc-18-3495-2024-f10

Figure 10Scatter plots showing the relationship between relative elevation and relative snow depth at the three spatial scales using 50 m SD data to represent the point value.

Download

https://tc.copernicus.org/articles/18/3495/2024/tc-18-3495-2024-f11

Figure 11(a) Spatial scale vs. R2 correlation between RSD and relative elevation. Points with p values less than 0.05 are marked with a filled circle, while sites with p values greater than 0.05 are marked with an “x” marker. (b) Scale vs. the mean range of elevations calculated from all sites.

Download

4 Discussion

4.1 High-bias tendency at operational snow stations

We found that station SDs exceeded the areal-mean snow depth by at least 10 cm in ∼50 % of cases at all scales (Figs. 5, 6). Longer persisting snow at snow stations is beneficial for water supply forecasts, but it is unclear whether this bias is by design. The finding of snow stations to be biased high compared to the areal-mean snow depth is not unprecedented. Grünewald and Lehning (2011, 2015) found that snow stations typically overestimate the mean snow depth of both the corresponding elevation band and the entire catchment when analyzing snow depth surrounding areas that fit the qualifications for a snow station location. Meromy et al. (2013) analyzed 53 samples, designating a site as representative if the station SD was within ±10 % of the areal-mean SD. Using that definition, 51 % of their station SDs were representative, 30 % were high, and 19 % were low at the 1 km scale. This distribution more closely matches the distribution we observed when using 50 m SD as the point snow depth but still demonstrates a slight high bias. It is important to note that the use of percentage from the areal-mean snow depth is different than our use of magnitude from the areal-mean snow depth, which could affect the results.

Comparing the snow depths we use to represent the snow stations demonstrates that the station SD values are consistently higher than the 50 m SD values (Fig. 7). The general agreement between the 3 m SD and station SD values, two independent data sources, suggests that the deeper snow depths at the snow stations are not a result of differences in sampling methodology (i.e., lidar vs. ultrasonic depth sensor) but rather fine-scale (several meters) spatial variability within the 50 m pixel. A higher proportion of sites are representative of larger areas when using 50 m SD as opposed to station SD (Fig. 5). This suggests that the high bias at the fine-scale station location lowers representativeness. Uniformly correcting the bias exhibited by snow station snow depths would mitigate this problem at some sites but risks deteriorating representativeness at low-biased sites. Thus, bias correction would have to be site specific and require existing spatial snow depth data.

Why are station SDs higher than the corresponding 50 m SD values? There are two possibilities of why station location within a 50 m pixel causes a high bias: either (1) there is a persistent bias caused by snow station location or (2) the bias is caused by the snow station infrastructure. Grünewald and Lehning (2011) suggested that deeper snow at stations compared to the surrounding area was a result of flat terrain at a snow station compared to the sloping terrain characteristic of a mountain watershed. Persistent shielding effects or placement within forest gaps could provide another location-based explanation for the high bias. The bias could also be introduced by the snow pillow, which is a flat, vegetation-free structure with thermal properties distinct from the surrounding forest floor. A final explanation could be that snow density is systematically lower at the snow station, meaning that the increased SD would not actually result in differences in SWE. Density could be lower due to altered thermal exchange at the snow–ground interface due to the snow pillow (i.e., hence changing metamorphism) or due to wind sheltering (e.g., reduced rates of settlement and compaction of newer snow). This final issue highlights the limitations of working in terms of snow depth, since spatial variations in density can influence snow depth variations (e.g., Bonnell et al., 2023; Meehan et al., 2023). Knowledge of both depth and density are needed to accurately resolve spatial distributions of SWE. In all, further work is required to ascertain the exact cause of higher snow depths recorded at snow stations compared to the surrounding 50 m area.

4.2 Temporal consistency of station biases

Snow stations exhibit both intra- and inter-annual consistency in the directional bias of RSD. At least half of sites with three or more lidar surveys demonstrate almost exclusively unidirectional bias in RSD at all three scales (Fig. 8). Meromy et al. (2013) also found consistent bias direction and magnitude at many sites in their investigation. Another study analyzing basin-wide snowpack using lidar data found consistent patterns of snowpack in years with similar meteorological characteristics (Pflug and Lundquist, 2020). Topography, land cover, and typical storm tracks are relatively static on annual timescales (e.g., Liston, 1999). If these are the factors that control snow depth distribution, it is not unexpected that RSD biases would also be similar from year to year at a given site.

Given this consistency, it may only take a few lidar surveys at a site to determine the relationship of a snow station to the surrounding area at a certain scale. However, the timing of lidar surveys within the snow season would need to be considered since the magnitude of RSD varies throughout the season (Fig. 9). Lidar survey timing is currently biased towards peak SWE and the ablation season, with limited surveys during the accumulation season. Regardless, previous efforts to determine the relationship between a snow station and the surrounding area required labor-intensive manual sampling of snow depth surrounding a snow station. Thus, we can increase the utility of the temporally continuous snow station data with just a few lidar surveys. The consistency we observe provides the opportunity to adjust snow station data based on the typical RSD bias at a site for other applications. Doing so would cause the adjusted value to be more in line with the areal-mean snow depth, improving its utility for remote sensing ground truthing, data assimilation, or model validation efforts.

4.3 Influence of land cover and topography

Vegetation and topography influence the distribution of snow across the landscape (Anderson et al., 2014; Clark et al., 2011; López-Moreno and Stähli, 2008; Varhola et al., 2010). Previous efforts that used statistical approaches (e.g., binary regression trees) to identify the physiographic controls on snow depth surrounding a snow station determined both elevation and fractional vegetation to be major controls on snow depth variability (Meromy et al., 2013; Molotch and Bales, 2006). Rice and Bales (2010) attributed the inability of the Gin Flat snow course and snow pillow to represent larger areas to differences in the surrounding physiography. Assessing the role of specific landscape factors on relative snow depth could inform the likelihood of a site to be representative based on the surrounding physiography.

4.3.1 Influence of elevation

Snow depth generally increases with elevation due to increased precipitation and colder temperatures, except at the highest altitudes where wind redistribution is more significant (Grünewald et al., 2014). We found that relative elevation and RSD have significant correlations at scales greater than or equal to 0.5 km (Fig. 11a). The increasing correlation with scale is likely linked to a growing range of elevation values (i.e., complex mountainous terrain), which have an increased impact on relative snow depth (Fig. 11b). As scale increases, sites are more likely to have higher-magnitude relative elevation values, leading to higher magnitude RSD values (and fewer representative sites).

The results show that the proportion of representative sites decreases with scale until plateauing between the 6–7 km scale (Fig. 6). The close matching of the representativeness curve (Fig. 6) to the R2 curve (Fig. 11a) suggests that these relationships are closely linked. Within the range of scales we assessed in the available data, the larger the scale, the less likely an individual site is to be representative (until the 7 km scale). It is unclear why the proportion of representative sites stabilizes at the 7 km scale, but one possible explanation is that other local factors controlling areal-mean snow depth keep the impact of relative elevation on RSD from increasing further. It is important to note that high-magnitude relative elevation values are the primary cause for deteriorating representativeness at larger scales, not the scale itself. At the 4 km scale, relative elevation alters RSD by ∼16 cm per 100 m (Fig. 10). Thus, sites with high-magnitude relative elevation values could be adjusted using this slope to better represent the areal-mean snow depth. It is important to note that the slope (change in RSD per change in relative elevation) calculated here is a mean slope of all sites used in this study. Local factors impact the rate of snow depth change with elevation, so calculating a slope of relative elevation vs. RSD at an individual site would be a more accurate way to adjust RSD.

4.3.2 Influence of vegetation

Previous studies identified fractional vegetation as a major control on snow depth distribution (Meromy et al., 2013; Molotch and Bales, 2006). We found significant relationships between fractional vegetation and snow depth (i.e., the non-relative values) at 86 % of sites (at the 4 km scale) but found no significant relationships between relative fractional vegetation and relative snow depth. This indicates that vegetation does impact snow depth, but the relative metrics we employ are unable to capture this dynamic. The relationship between vegetation and snowpack is complex and nonlinear and (depending on climate) may shift within a single snow season (e.g., less deep snow in the forest in midwinter but deeper snow in the forest in the spring melt season) (Dickerson-Lange et al., 2021; Lundquist et al., 2013; Mazzotti et al., 2020; Bonner et al., 2022). Additionally, there may be different snow depth regimes within subcanopy zones and gaps in a forest (e.g., Currier and Lundquist, 2018). Given these factors, the relationship between fractional vegetation and snow depth is much more complex than the comparatively simple (and linear) lapse rate effects of elevation on temperature and precipitation.

Accurately simulating forest effects on snow cover also requires extremely high spatial resolutions (<5 m) (Clark et al., 2011; Mazzotti et al., 2021), which would not be captured by the 30 m fractional vegetation data set we employ. Additionally, we used relative fractional vegetation as the metric to describe site vegetation, which reduces vegetation dynamics to a single value. A single value may be insufficient to capture the complex dynamics of vegetation effects on snow. For example, an areal-mean fractional vegetation of 0.5 could represent either an area split into equal parts of 100 % and 0 % vegetation cover or a homogeneous area with 50 % vegetation cover. The impact of vegetation on snow distribution at these two example sites could be considerably different, but the areal-mean value is unable to convey the difference in vegetation distribution between the sites. An analysis of the high-resolution spatial distribution of vegetation involving the distribution of forest gaps would conceivably reveal the influence of vegetation on relative snow depth but is beyond the scope of this paper.

4.3.3 Influence of southness

It is well documented that slope and aspect impact snow distribution (e.g., Golding and Swanson, 1986; Murray and Buttle, 2003). We similarly found significant relationships between southness and snow depth at 93 % of sites (at the 4 km scale) but no significant relationships between relative southness and relative snow depth. One explanation for the lack of significant relationship is that snow station southness is not different enough from the surrounding area to impact snow depth. Snow stations are strategically placed on flat areas, which could reduce the influence of relative southness. It is possible that other landscape factors outweigh the impact of southness on snow depth, making its impact more difficult to ascertain. More complex analyses that take multiple variables into account may be required to determine the relative importance of landscape variables on relative snow depth.

5 Conclusions

We analyzed snow depth distributions surrounding snow stations at three scales using coincident lidar–snow station data in Colorado and California from 2021–2023. Snow stations (station SDs) record snow depths within ±10 cm of the areal-mean snow depth in approximately one-third of cases at all three scales, while overestimating the areal-mean snow depth by greater than 10 cm in ∼50 % of cases. When relative snow depth is calculated using 50 m SD, the frequency of site representation is increased to ∼50 % at the 0.5 and 1 km scales. Representativeness increases when using 50 m SD because snow station locations record snow depths that are on average ∼10 cm greater than the surrounding 50 m area. This high bias needs to be considered when using snow station data for validation. Representativeness decreases with scale because relative elevation magnitudes increase, causing lapse rates to impact relative snow depth via changes in areal-mean snow depth. The directional bias of RSD at a snow station is consistent from survey to survey. Together, these results suggest there is an opportunity to increase the utility of snow stations for model validation and ground truthing. Future work should focus on determining the underlying influences that cause site bias, potentially allowing for a priori identification of a site's relationship with the surrounding area. Adjusting snow station data based on the consistent high bias compared to the surrounding 50 m area or based on the typical trend of RSD would increase the ability of a snow station to better represent the surrounding area, particularly at scales of 1 km or less.

Data availability

Land cover and topography data were provided by the United States Geological Survey. Snow station data were provided by the USDA NRCS and the CA-DWR (https://wcc.sc.egov.usda.gov/reportGenerator/, USDA NRCS, 2023; https://cdec.water.ca.gov/dynamicapp/selectSnow, California department of water resources , 2023). Lidar data were provided by ASO, Inc. (https://www.airbornesnowobservatories.com/, Painter et al., 2016).

Supplement

The supplement related to this article is available online at: https://doi.org/10.5194/tc-18-3495-2024-supplement.

Author contributions

JNH carried out the analyses, created the figures, and wrote the manuscript. EES and MSR helped design the experiments and edit the figures and manuscript text.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

This work was supported by the National Aeronautics and Space Administration (NASA) through its Water Resources Program and Terrestrial Hydrology Program. We would like to express our gratitude to NASA for their financial support and commitment to advancing research in water resources and terrestrial hydrology. We would also like to extend our thanks to the University of Colorado Boulder and Oregon State University, as well as the various other institutions and organizations that provided invaluable data and resources for this investigation. Their contributions were essential to the success of this research.

Financial support

This research has been supported by the National Aeronautics and Space Administration (grant nos. 80NSSC22K0928, 80NSSC22K0685, and NNX17AL41G).

Review statement

This paper was edited by Franziska Koch and reviewed by Wyatt Reis and Hannah Besso.

References

Anderson, B. T., McNamara, J. P., Marshall, H.-P., and Flores, A. N.: Insights into the physical processes controlling correlations between snow distribution and terrain properties, Water Resour. Res., 50, 4545–4563, https://doi.org/10.1002/2013WR013714, 2014. 

Barrett, A. P.: National Operational Hydrologic Remote Sensing Center SNOw Data Assimilation System (SNODAS) Products at NSIDC, NSIDC Special Report 11, Boulder, CO, USA, National Snow and Ice Data Center, 2003. 

Blankinship, J. C., Meadows, M. W., Lucas, R. G., and Hart, S. C.: Snowmelt timing alters shallow but not deep soil moisture in the Sierra Nevada, Water Resour. Res., 50, 1448–1456, https://doi.org/10.1002/2013WR014541, 2014. 

Blöschl, G.: Scaling issues in snow hydrology, Hydrol. Process., 13, 2149–2175, https://doi.org/10.1002/(SICI)1099-1085(199910)13:14/15<2149::AID-HYP847>3.0.CO;2-8, 1999. 

Bonnell, R., McGrath, D., Hedrick, A. R., Trujillo, E., Meehan, T. G., Williams, K., Marshall, H.-P., Sexstone, G., Fulton, J., Ronayne, M. J., Fassnacht, S. R., Webb, R. W., and Hale, K. E.: Snowpack relative permittivity and density derived from near-coincident lidar and ground-penetrating radar, Hydrol. Process., 37, e14996, https://doi.org/10.1002/hyp.14996, 2023. 

Bonner, H. M., Smyth, E., Raleigh, M. S., and Small, E. E.: A Meteorology and Snow Data Set From Adjacent Forested and Meadow Sites at Crested Butte, CO, USA, Water Resour. Res., 58, e2022WR033006, https://doi.org/10.1029/2022WR033006, 2022. 

Broxton, P. D., van Leeuwen, W. J. D., and Biederman, J. A.: Improving Snow Water Equivalent Maps With Machine Learning of Snow Survey and Lidar Measurements, Water Resour. Res., 55, 3739–3757, https://doi.org/10.1029/2018WR024146, 2019. 

California department of water resources: Historical snow sensor data, California Data Exchange Center [data set], https://cdec.water.ca.gov/dynamicapp/selectSnow, last access: 1 October 2023. 

Clark, M. P., Hendrikx, J., Slater, A. G., Kavetski, D., Anderson, B., Cullen, N. J., Kerr, T., Örn Hreinsson, E., and Woods, R. A.: Representing spatial variability of snow water equivalent in hydrologic and land-surface models: A review, Water Resour. Res., 47, W07539, https://doi.org/10.1029/2011WR010745, 2011. 

Currier, W. R. and Lundquist, J. D.: Snow Depth Variability at the Forest Edge in Multiple Climates in the Western United States, Water Resour. Res., 54, 8756–8773, https://doi.org/10.1029/2018WR022553, 2018. 

DeChant, C. M. and Moradkhani, H.: Improving the characterization of initial condition for ensemble streamflow prediction using data assimilation, Hydrol. Earth Syst. Sci., 15, 3399–3410, https://doi.org/10.5194/hess-15-3399-2011, 2011. 

Dewitz, J.: National Land Cover Database, United States Geological Survey, https://doi.org/10.5066/P9KZCM54, 2021. 

Dickerson-Lange, S. E., Vano, J. A., Gersonde, R., and Lundquist, J. D.: Ranking Forest Effects on Snow Storage: A Decision Tool for Forest Management, Water Resour. Res., 57, e2020WR027926, https://doi.org/10.1029/2020WR027926, 2021. 

Dierauer, J. R., Allen, D. M., and Whitfield, P. H.: Snow Drought Risk and Susceptibility in the Western United States and Southwestern Canada, Water Resour. Res., 55, 3076–3091, https://doi.org/10.1029/2018WR023229, 2019. 

Dollery, R., Hodkinson, I. D., and Jónsdóttir, I. S.: Impact of warming and timing of snow melt on soil microarthropod assemblages associated with Dryas-dominated plant communities on Svalbard, Ecography, 29, 111–119, https://doi.org/10.1111/j.2006.0906-7590.04366.x, 2006. 

Dozier, J. and Frew, J.: Rapid calculation of terrain parameters for radiation modeling from digital elevation data, IEEE T. Geosci. Remote, 28, 963–969, https://doi.org/10.1109/36.58986, 1990. 

Dozier, J., Bair, E. H., and Davis, R. E.: Estimating the spatial distribution of snow water equivalent in the world's mountains, WIREs Water, 3, 461–474, https://doi.org/10.1002/wat2.1140, 2016. 

Fassnacht, S. R., Brown, K. S. J., Blumberg, E. J., López Moreno, J. I., Covino, T. P., Kappas, M., Huang, Y., Leone, V., and Kashipazha, A. H.: Distribution of snow depth variability, Front. Earth Sci., 12, 683–692, https://doi.org/10.1007/s11707-018-0714-z, 2018. 

Gesch, D. B., Evans, G. . A., Oimoen, M. J., and Arundel, S.: The National Elevation Dataset, American Society for Photogrammetry and Remote Sensing, 83–110, 2018. 

Golding, D. L. and Swanson, R. H.: Snow distribution patterns in clearings and adjacent forest, Water Resour. Res., 22, 1931–1940, https://doi.org/10.1029/WR022i013p01931, 1986. 

Grünewald, T. and Lehning, M.: Altitudinal dependency of snow amounts in two small alpine catchments: can catchment-wide snow amounts be estimated via single snow or precipitation stations?, Ann. Glaciol., 52, 153–158, https://doi.org/10.3189/172756411797252248, 2011. 

Grünewald, T. and Lehning, M.: Are flat-field snow depth measurements representative? A comparison of selected index sites with areal snow depth measurements at the small catchment scale, Hydrol. Process., 29, 1717–1728, https://doi.org/10.1002/hyp.10295, 2015. 

Grünewald, T., Bühler, Y., and Lehning, M.: Elevation dependency of mountain snow depth, The Cryosphere, 8, 2381–2394, https://doi.org/10.5194/tc-8-2381-2014, 2014. 

Klein, A. G. and Barnett, A. C.: Validation of daily MODIS snow cover maps of the Upper Rio Grande River Basin for the 2000–2001 snow year, Remote Sens. Environ., 86, 162–176, https://doi.org/10.1016/S0034-4257(03)00097-X, 2003. 

Lettenmaier, D. P., Alsdorf, D., Dozier, J., Huffman, G. J., Pan, M., and Wood, E. F.: Inroads of remote sensing into hydrologic science during the WRR era, Water Resour. Res., 51, 7309–7342, https://doi.org/10.1002/2015WR017616, 2015. 

Li, D., Wrzesien, M. L., Durand, M., Adam, J., and Lettenmaier, D. P.: How much runoff originates as snow in the western United States, and how will that change in the future?, Geophys. Res. Lett., 44, 6163–6172, https://doi.org/10.1002/2017GL073551, 2017. 

Lievens, H., Brangers, I., Marshall, H.-P., Jonas, T., Olefs, M., and De Lannoy, G.: Sentinel-1 snow depth retrieval at sub-kilometer resolution over the European Alps, The Cryosphere, 16, 159–177, https://doi.org/10.5194/tc-16-159-2022, 2022. 

Liston, G. E.: Interrelationships among Snow Distribution, Snowmelt, and Snow Cover Depletion: Implications for Atmospheric, Hydrol. Ecol. Model., 38, 1474–1487, https://doi.org/10.1175/1520-0450(1999)038<1474:IASDSA>2.0.CO;2, 1999. 

López-Moreno, J. I. and Stähli, M.: Statistical analysis of the snow cover variability in a subalpine watershed: Assessing the role of topography and forest interactions, J. Hydrol., 348, 379–394, https://doi.org/10.1016/j.jhydrol.2007.10.018, 2008. 

López-Moreno, J. I., Fassnacht, S. R., Beguería, S., and Latron, J. B. P.: Variability of snow depth at the plot scale: implications for mean depth estimation and sampling strategies, The Cryosphere, 5, 617–629, https://doi.org/10.5194/tc-5-617-2011, 2011. 

Lundquist, J. D., Dickerson-Lange, S. E., Lutz, J. A., and Cristea, N. C.: Lower forest density enhances snow retention in regions with warmer winters: A global framework developed from plot-scale observations and modeling, Water Resour. Res., 49, 6356–6370, https://doi.org/10.1002/wrcr.20504, 2013. 

Margulis, S. A., Fang, Y., Li, D., Lettenmaier, D. P., and Andreadis, K.: The Utility of Infrequent Snow Depth Images for Deriving Continuous Space-Time Estimates of Seasonal Snow Water Equivalent, Geophys. Res. Lett., 46, 5331–5340, https://doi.org/10.1029/2019GL082507, 2019. 

Mazzotti, G., Essery, R., Moeser, C. D., and Jonas, T.: Resolving Small-Scale Forest Snow Patterns Using an Energy Balance Snow Model With a One-Layer Canopy, Water Resour. Res., 56, e2019WR026129, https://doi.org/10.1029/2019WR026129, 2020. 

Mazzotti, G., Webster, C., Essery, R., and Jonas, T.: Increasing the Physical Representation of Forest-Snow Processes in Coarse-Resolution Models: Lessons Learned From Upscaling Hyper-Resolution Simulations, Water Resour. Res., 57, e2020WR029064, https://doi.org/10.1029/2020WR029064, 2021. 

Meehan, T. G., Hojatimalekshah, A., Marshall, H.-P., Deeb, E. J., O'Neel, S., McGrath, D., Webb, R. W., Bonnell, R., Raleigh, M. S., Hiemstra, C., and Elder, K.: Spatially distributed snow depth, bulk density, and snow water equivalent from ground-based and airborne sensor integration at Grand Mesa, Colorado, USA, The Cryosphere Discuss. [preprint], https://doi.org/10.5194/tc-2023-141, in review, 2023. 

Meromy, L., Molotch, N. P., Link, T. E., Fassnacht, S. R., and Rice, R.: Subgrid variability of snow water equivalent at operational snow stations in the western USA, Hydrol. Process., 27, 2383–2400, https://doi.org/10.1002/hyp.9355, 2013. 

Molotch, N. P. and Bales, R. C.: Scaling snow observations from the point to the grid element: Implications for observation network design, Water Resour. Res., 41, 1–16, https://doi.org/10.1029/2005WR004229, 2005. 

Molotch, N. P. and Bales, R. C.: SNOTEL representativeness in the Rio Grande headwaters on the basis of physiographics and remotely sensed snow cover persistence, Hydrol. Process., 20, 723–739, https://doi.org/10.1002/hyp.6128, 2006. 

Molotch, N. P., Colee, M. T., Bales, R. C., and Dozier, J.: Estimating the spatial distribution of snow water equivalent in an alpine basin using binary regression tree models: The impact of digital elevation data and independent variable selection, Hydrol. Process., 19, 1459–1479, https://doi.org/10.1002/hyp.5586, 2005. 

Murray, C. D. and Buttle, J. M.: Impacts of clearcut harvesting on snow accumulation and melt in a northern hardwood forest, J. Hydrol., 271, 197–212, https://doi.org/10.1016/S0022-1694(02)000352-9, 2003. 

Musselman, K. N., Lehner, F., Ikeda, K., Clark, M. P., Prein, A. F., Liu, C., Barlage, M., and Rasmussen, R.: Projected increases and shifts in rain-on-snow flood risk over western North America, Nat. Clim. Change, 8, 808–812, https://doi.org/10.1038/s41558-018-0236-4, 2018. 

NRCS: Part 622 Snow Survey and Water Supply Forecasting National Engineering Handbook, 210-VI-NEH, Amend. 43, July 2011, https://directives.sc.egov.usda.gov/landingpage/82fbec53-5b08-4441-ba9e-47b4193a96f1 (last access: 31 July 2024), 2011. 

Pagano, T. C., Garen, D. C., Perkins, T. R., and Pasteris, P. A.: Daily Updating of Operational Statistical Seasonal Water Supply Forecasts for the western U.S.1, JAWRA J. Am. Water Resour. A., 45, 767–778, https://doi.org/10.1111/j.1752-1688.2009.00321.x, 2009. 

Painter, T. H., Berisford, D. F., Boardman, J. W., Bormann, K. J., Deems, J. S., Gehrke, F., Hedrick, A., Joyce, M., Laidlaw, R., Marks, D., Mattmann, C., McGurk, B., Ramirez, P., Richardson, M., Skiles, S. M. K., Seidel, F. C., and Winstral, A.: The Airborne Snow Observatory: Fusion of scanning lidar, imaging spectrometer, and physically-based modeling for mapping snow water equivalent and snow albedo, Remote Sens. Environ., 184, 139–152, https://doi.org/10.1016/j.rse.2016.06.018, 2016 (data available at: https://www.airbornesnowobservatories.com/, last access: 1 October 2023). 

Pan, M., Sheffield, J., Wood, E. F., Mitchell, K. E., Houser, P. R., Schaake, J. C., Robock, A., Lohmann, D., Cosgrove, B., Duan, Q., Luo, L., Higgins, R. W., Pinker, R. T., and Tarpley, J. D.: Snow process modeling in the North American Land Data Assimilation System (NLDAS): 1. Evaluation of model simulated snow water equivalent, J. Geophys. Res.-Atmos., 108, 8850, https://doi.org/10.1029/2003jd003994, 2003. 

Pflug, J. M. and Lundquist, J. D.: Inferring Distributed Snow Depth by Leveraging Snow Pattern Repeatability: Investigation Using 47 Lidar Observations in the Tuolumne Watershed, Sierra Nevada, California, Water Resour. Res., 56, e2020WR027243, https://doi.org/10.1029/2020WR027243, 2020. 

Qin, Y., Abatzoglou, J. T., Siebert, S., Huning, L. S., AghaKouchak, A., Mankin, J. S., Hong, C., Tong, D., Davis, S. J., and Mueller, N. D.: Agricultural risks from changing snowmelt, Nat. Clim. Change, 10, 459–465, https://doi.org/10.1038/s41558-020-0746-8, 2020. 

Raleigh, M. S. and Small, E. E.: Snowpack density modeling is the primary source of uncertainty when mapping basin-wide SWE with lidar, Geophys. Res. Lett., 44, 3700–3709, https://doi.org/10.1002/2016GL071999, 2017. 

Rice, R. and Bales, R. C.: Embedded-sensor network design for snow cover measurements around snow pillow and snow course sites in the Sierra Nevada of California, Water Resour. Res., 46, 3, https://doi.org/10.1029/2008WR007318, 2010. 

Riggs, G. and Hall, D.: Continuity of MODIS and VIIRS Snow Cover Extent Data Products for Development of an Earth Science Data Record, Remote Sens., 12, 3781, https://doi.org/10.3390/rs12223781, 2020. 

Schneider, D. and Molotch, N. P.: Real-time estimation of snow water equivalent in the Upper Colorado River Basin using MODIS-based SWE Reconstructions and SNO? data, Water Resour. Res., 52, 7892–7910, https://doi.org/10.1002/2016WR019067, 2016. 

Scipión, D. E., Mott, R., Lehning, M., Schneebeli, M., and Berne, A.: Seasonal small-scale spatial variability in alpine snowfall and snow accumulation, Water Resour. Res., 49, 1446–1457, https://doi.org/10.1002/wrcr.20135, 2013.  

Slater, A. G., and Clark, M. P.: Snow data assimilation via an ensemble Kalman filter, J. Hydrometeorol., 7.3, 478–493, https://doi.org/10.1175/JHM505.1, 2006. 

Smyth, E. J., Raleigh, M. S., and Small, E. E.: Improving SWE Estimation With Data Assimilation: The Influence of Snow Depth Observation Timing and Uncertainty, Water Resour. Res., 56, e2019WR026853, https://doi.org/10.1029/2019WR026853, 2020. 

USDA NRCS: Snow-Telemetry daily snow depth dataset, USDA Natural Resources Conservation Service [data set], https://wcc.sc.egov.usda.gov/reportGenerator/, last access: 1 October 2023. 

Varhola, A., Coops, N. C., Weiler, M., and Moore, R. D.: Forest canopy effects on snow accumulation and ablation: An integrative review of empirical results, J. Hydrol., 392, 219–233, https://doi.org/10.1016/j.jhydrol.2010.08.009, 2010. 

Watson, F. G. R., Anderson, T. N., Newman, W. B., Alexander, S. E., and Garrott, R. A.: Optimal sampling schemes for estimating mean snow water equivalents in stratified heterogeneous landscapes, J. Hydrol., 328, 432–452, https://doi.org/10.1016/j.jhydrol.2005.12.032, 2006. 

Westerling, A. L., Hidalgo, H. G., Cayan, D. R., and Swetnam, T. W.: Warming and earlier spring increase Western U.S. forest wildfire activity, Science, 313, 940–943, https://doi.org/10.1126/science.1128834, 2006. 

Wetlaufer, K., Hendrikx, J., and Marshall, L.: Spatial heterogeneity of snow density and its influence on snow water equivalence estimates in a large mountainous basin, Hydrology, 3, 3, https://doi.org/10.3390/hydrology3010003, 2016. 

Woelders, L., Lukas, J., Payton, E., and Duncan, B.: Snowpack Monitoring in the Rocky Mountain West: A User Guide. Western Water, http://wwa.colorado.edu/publications/reports (last access: 31 July 2024), 2020. 

Download
Short summary
Automated stations measure snow properties at a single point but are frequently used to validate data that represent much larger areas. We use lidar snow depth data to see how often the mean snow depth surrounding a snow station is within 10 cm of the snow station depth at different scales. We found snow stations overrepresent the area-mean snow depth in ~ 50 % of cases, but the direction of bias at a site is temporally consistent, suggesting a site could be calibrated to the surrounding area.