The optical characteristics and sources of chromophoric dissolved organic matter ( CDOM ) in seasonal snow of northwestern China

Chromophoric dissolved organic matter (CDOM) plays an important role in the global carbon cycle and energy budget but is rarely studied in seasonal snow. A field campaign was conducted across northwestern China from January to February 2012, and surface snow samples were collected at 39 sites in Xinjiang and Qinghai provinces. Absorption and fluorescence spectroscopies, along with chemical analysis, were used to investigate the optical characteristics and potential sources of CDOM in seasonal snow. The abundance of CDOM, shown as the absorption coefficient at 280 nm, aCDOM(280), and the spectral slope from 275 to 295 nm (S275−295) ranged from 0.15 to 10.57 m−1 and 0.0129 to 0.0389 nm−1. The highest average aCDOM(280) (2.30±0.52 m−1)was found in Qinghai, and the lowest average S275−295 (0.0188±0.0015 nm−1) indicated that the snow CDOM in this region had a strongly terrestrial characteristic. The lower values of aCDOM(280) were found at sites located to the north of the Tianshan Mountains and northwestern Xinjiang along the border of China (0.93±0.68 m−1 and 0.80± 0.62 m−1). Parallel factor (PARAFAC) analysis identified three types of fluorophores that were attributed to two humic-like substances (HULIS, C1 and C2) and one proteinlike material (C3). C1 was mainly from soil HULIS, C3 was a type of autochthonously labile organic matter, while the potential sources of C2 were complex, including soil, microbial activity, anthropogenic pollution, and biomass burning. Furthermore, the regional variations of sources for snow CDOM were assessed by analyses of chemical species (e.g., soluble ions), fluorescent components, and air mass backward trajectories combined with satellite-derived active-fire locations.


Introduction
Dissolved organic matter (DOM) is widely distributed in natural aquatic ecosystems and plays a key role in the global carbon cycle (Massicotte et al., 2017).Chromophoric dissolved organic matter (CDOM), widely known as the lightabsorbing constituent of DOM, can absorb light from ultraviolet to visible (UV-vis) wavelengths (Bricaud et al., 1981).Owing to its light-absorbing properties, CDOM is important in biological processes (Seekell et al., 2015;Thrane et al., 2014), photochemical processes (Helms et al., 2013;Vaehaetalo and Wetzel, 2004), and the energy budget (Hill and Zimmerman, 2016;Pegau, 2002) in natural water bodies.
Compared to the aquatic environments, there were only limited studies evaluating DOM in the cryosphere.Whereas the global glacier ecosystem is a large organic carbon pool and exports approximately 1.04±0.18TgC yr −1 of dissolved organic carbon into freshwater and marine environments (Hood et al., 2015).In addition, the glacier-derived DOM shows high bioavailability and can be a source of labile organic matter for downstream ecosystems (Hood et al., 2009;Lawson et al., 2014;Singer et al., 2012).The DOM in snow and ice originates from in situ processes (autochthonous) such as microbial activity (Anesio et al., 2009) and is imported from the surrounding terrestrial environments (allochthonous), including soil, vegetation (Bhatia et al., 2010), and anthropogenic activity (Stubbins et al., 2012).
Snowfall is an important carbon and nutrient input for land ecosystems (Mladenov et al., 2012) and a crucial freshwater reservoir (Jones, 1999).Snowpack is also an active field for photochemical (Beine et al., 2011;Domine et al., 2013) and biological processes (Liu et al., 2009;Lutz et al., 2016).Unlike the aquatic environments, high surface albedo is the most obvious physical property of snow (IPCC, 2013).Once light-absorbing impurities are deposited on the snow surface, the albedo can be significantly reduced, and the regional and global climate are further affected (Hadley and Kirchstetter, 2012).Several field campaigns covering the Arctic, Russia, North America, and northern China have been conducted to measure insoluble light-absorbing particles (ILAPs) in snow, for instance black carbon (BC), insoluble organic carbon (ISOC), and mineral dust (MD) (Doherty et al., 2010(Doherty et al., , 2014(Doherty et al., , 2015;;Huang et al., 2011;Pu et al., 2017;Wang et al., 2013Wang et al., , 2015Wang et al., , 2017;;Warren and Wiscombe, 1980;Ye et al., 2012;Y. Zhou et al., 2017).However, these studies neglected CDOM, which is rarely studied in snow but has been proven to be an effective light absorber whether in the atmosphere (i.e., brown carbon, BrC) (Hecobian et al., 2010) or in water bodies (Bricaud et al., 1981).To constrain the photochemistry of snow soluble chromophores, Anastasio and Robles (2007) first quantified the light absorption of dissolved chromophores in melted snow samples from the Arctic and Antarctica.They found that in addition to NO − 3 and H 2 O 2 , approximately half of the light absorption at 280 nm and above was due to unknown chromophores, probably organics.After that, Beine et al. (2011) analyzed more than 500 snow samples collected in Alaska.They exhibited slight contributions of H 2 O 2 and NO − 3 to the total absorption within 300-450 nm (combined < 9 %), while humic-like substances (HULIS), which are a type of macromolecular organic matter defined for aerosol with certain similar chemical properties to terrestrial and aquatic humic and fulvic substances (Graber and Rudich, 2006), and unknown chromophores each accounted for approximately half of the total absorption.Recently, several studies have started to focus on the optical properties and radiative forcing of CDOM in glaciers on the Tibetan Plateau.Yan et al. (2016) measured the mass absorption cross section (MAC) of CDOM in snow (1.4 ± 0.4 m 2 g −1 at 365 nm) at Laohugou Glacier, northern Tibetan Plateau, and further calculated the radiative forcing of CDOM, which accounted for approximately 10 % relative to that of BC.Niu et al. (2018) showed quite a high MAC value of CDOM (6.31 ± 0.34 m 2 g −1 at 365 nm) in snow and ice samples collected on Mt.Yulong, southeastern Tibetan Plateau.Moreover, it is surprising that the light absorption of CDOM within 330 to 400 nm was approximately 4 times higher than that of BC, although with high uncertainty.In above studies, CDOM showed significant effects on the energy budget of surface snow and ice on glaciers.Until now, the study of CDOM in snow and ice is still in its infancy, and much more work is imperative to improve our understanding of it.In northern China, snowpack is affected by more anthropogenic activities or sunlight than those at higher elevation or latitude, thus the effects of CDOM may be more remarkable.Therefore, we conducted a large field campaign to investigate the CDOM in seasonal snow of northwestern China from January to February 2012.
UV-vis absorption and fluorescence spectroscopies are both rapid and effective methods for characterizing the optical properties and sources of CDOM.The absorption coefficient at a certain wavelength within the UV band, for instance, 254, 280, or 350 nm (Spencer et al., 2012;Zhang et al., 2010Zhang et al., , 2011)), usually serves as an indicator of CDOM abundance.The absorption spectrum of CDOM decreases approximately exponentially with increasing wavelength (Helms et al., 2008), and is usually described by the spectral slope (S) (Twardowski et al., 2004).Helms et al. (2008) used the spectral slope between 275 and 295 nm (S 275−295 ) to investigate the molecular weight and sources of CDOM (terrestrial or marine origin), i.e., lower S 275−295 corresponds to CDOM with a higher molecular weight and a more obviously terrestrial characteristic.Fluorescence excitation-emission matrix (EEM) has been widely used to identify the sources and compositions (humic-like or protein-like) of fluorescent DOM (FDOM) in natural waterbodies (Birdwell and Engel, 2010;Coble, 1996;Zhao et al., 2016), rainwater (Y.Q. Zhou et al., 2017), fog water (Birdwell and Valsaraj, 2010), and aerosols (Duarte et al., 2004;Lee et al., 2013;Mladenov et al., 2011).To precisely extract useful information from the large data set of EEMs, Bro (1997) successfully applied parallel factor (PARAFAC) analysis to decompose EEMs into several independent fluorescent components.Due to the great advantages of PARAFAC analysis in interpreting the results of EEMs, this has been the mainstream approach in recent natural CDOM studies (Murphy et al., 2013).However, the application of EEM combined with PARAFAC analysis in the cryosphere is scarce.Therefore, we try to employ it to characterize the snow CDOM.
In this study, for the first time, with the aim of presenting a comprehensive understanding of CDOM in seasonal snow across northwestern China, UV-vis absorption and fluorescence spectroscopies along with chemical analysis were applied to investigate the abundances, optical properties, and potential sources of CDOM as well as their spatial distributions.

Sample collection
During January to February 2012, snow samples were collected at 7 sites in Qinghai and 32 sites in Xinjiang, northwestern China.The distribution of sample sites, which are numbered chronologically, is shown in Fig. 1.Based on Pu et al. (2017), these sites were separated into five regions by their geographical distribution to investigate the spatial variations of light absorption and fluorescence properties, as well as the potential sources of CDOM.Region 1 is in the southeastern part of Qinghai at high altitude, and other regions are The Cryosphere, 13, 157-175, 2019 in Xinjiang.Region 2 is along the Tianshan Mountains; region 3 is located to the north of the Tianshan Mountains and close to the industrial city belt in central Xinjiang.Regions 4 and 5 are in northwestern and northeastern Xinjiang, and both are along the border of China.
The sample sites were chosen to be upwind and far enough away from roads, railways, cities, and villages to minimize the effects of local pollution.Hence, the collected samples can be representative of a wide range of areas.Pictures of several sample sites are shown in Fig. 2. Snow samples were collected every 5 cm from top to bottom at each site.If there was a melt layer or fresh snow on the top layer, such a sample was collected individually.A pair of two adjacent vertical profiles of snow (left and right samples) to assess the variability of the same snowpack and to enhance the accuracy of the measurements.During this campaign, 13 fresh snow samples that had fallen during the sampling time were collected.In addition, at some sites, the snow was thin and patchy and the wind was strong; hence, these samples were gathered from snow drifts and were potentially influenced by the deposition of local soil dust (Ye et al., 2012).More de- tails on the sampling methods have been reported previously (Doherty et al., 2010;Wang et al., 2013;Ye et al., 2012).
After being returned to the laboratory in Lanzhou University, all the samples were stored in a freezer at −20 • C or lower for subsequent analyses.However, some previous studies indicated that the freeze-thaw process may lead to biases of the optical properties for DOM samples.For instance, Fellman et al. (2008) reported that there was a decrease in specific ultraviolet absorbance (SUVA) for stream water DOM after frozen, with a median of approximately 8 %.A study of peatland DOM found that the change in light absorption at 254 nm after freeze and thaw was less than 5 % of the median (Peacock et al., 2015).Thieme et al. (2016) assessed the changes in fluorescence properties for several types of DOM sample.The results showed the decreased relative percentages of terrestrial humic-like fluorophores (−3 % on average) and humification index (HIX, −2 % on average), and the increased percentage of fluviclike fluorophore (+6 % on average).Other studies have also shown that the optical properties (light absorption and fluorescence) of several types of DOM were not affected significantly by freezing, such as those in ocean water, pore water, spring, and cave water (Birdwell and Engel, 2010;Del Castillo and Coble, 2000;Otero et al., 2007;Yamashita et al., 2010).As discussed above, the freeze-thaw process may influence the relative contributions of PARAFAC components slightly, while the effects on a CDOM (280) and fluorescence indices can be neglected.The EEMs (n = 78) of surface snow samples were measured by an Aqualog spectrofluorometer system (Horiba Scientific, NJ, USA) in a 1 cm quartz cell.The scanning ranges were 240 to 600 nm in 5 nm intervals for excitation and 250 to 825 nm in 4.65 nm (8 pixels) intervals for emission, with the integrating time of 5 s.An ultrapure water blank was subtracted to remove the water Raman scatter peaks.
The inner filter effect (IFE) of EEM was corrected using the method shown in Kothawala et al. (2013).The fluorescence intensities were calibrated by the Raman peak of ultrapure water reference at a 350 nm excitation wavelength following the method presented by Lawaetz and Stedmon (2009).The Rayleigh scatter peaks were addressed by the EEMscat MATLAB toolbox (version 3) using an interpolation algorithm (Bahram et al., 2006).
PARAFAC is a multi-way method for modeling the data with three-or higher-order arrays (Murphy et al., 2013).For EEMs, the three dimensions are samples, excitation, and emission wavelengths.PARAFAC analysis can decompose the EEMs into several components with clear chemical interpretations.The details about the theory of PARAFAC analysis can be found in the Supplement.In this study, PARAFAC analysis was performed using the DOMFluor toolbox (version 1.7, Stedmon and Bro, 2008) in MATLAB.In addition, because the emission signals were mainly in the range of 250-650 nm, those at longer wavelengths were weak and more likely to be noises; hence, the emission wavelengths longer than 650 nm were not considered in the model.According to the analysis of residual error, splithalf method, and visual inspection, the three-component PARAFAC model was selected.The residual error decreased distinctly when the component number increased from two to three and from four to five (Fig. S1).Combined with the split-half analysis for 2-to 7-component models, only 2-and 3-component models were validated with the S 4 C 4 T 2 split scheme (Murphy et al., 2013).Therefore, the 3-component model was chosen here.The fluorescence intensity of each fluorescent component was expressed as F max in Raman unit (RU) (Stedmon and Markager, 2005b).The relative contributions of intensities for three components to the total fluorescence are given as %C1-%C3 hereinafter.In addition, three fluorescence-derived indices are widely used to identify the potential sources of CDOM.Zsolnay et al. (1999) presented HIX to describe the relative humification of DOM.The fluorescence index (FI) is used to identify the sources of DOM from terrestrial or microbial origins (McKnight et al., 2001), and the biological index (BIX) can be an indicator of autochthonous productivity (Huguet et al., 2009) where I is the fluorescence intensity, and Ex and Em are short for the excitation and emission wavelengths.We note that the wavelengths used in the calculation were changed slightly (1 nm or less) due to different instruments.

UV-vis absorption measurement
The UV-vis absorption spectra (n = 78) of snow samples were derived from 240 to 600 nm in 5 nm intervals, while the fluorescence measurements were conducted by an Aqualog spectrofluorometer system, and an ultrapure water blank was used as a reference.The absorbance of CDOM was assumed to be zero above 550 nm, and the average absorbance between 550 and 600 nm was subtracted from the whole spectrum to correct the baseline shifts and scattering effects of the measurement.The absorbances of samples were converted to absorption coefficients using the following equation: where A is the absorbance, λ is the wavelength, L is the path length of cuvette (0.01 m), and a CDOM is the absorption coefficient (m −1 ).The abundance of CDOM is presented by the absorption coefficient at 280 nm, a CDOM (280) (Zhang et al., 2010).The S 275−295 was determined both by a logtransform linear regression and an exponential regression.The variation of these two methods was approximately 3 % on average.Linear regression has been frequently used to calculate S 275−295 (Fichot and Benner, 2012;Helms et al., 2008;Yang et al., 2013), and in this study, showed higher R 2 values than exponential regression.Therefore, the results of linear regression were adopted here.Additionally, if the difference in S 275−295 between the linear and exponential methods was higher than 10 %, indicating a high uncertainty for absorption measurement, such data were removed.The absorption Ångström exponent (AAE) is used to describe the wavelength dependence of light absorption for aerosol (Bond, 2001), which has also been applied to characterize the ILAPs and CDOM in snow and ice (Doherty et al., 2010;Niu et al., 2018;Wang et al., 2013;Yan et al., 2016).The AAEs were calculated using power-law regression in the wavelength range of 240 to 550 nm, as follows: where K is a constant related to DOM concentration.The R 2 of all regressions (S 275−295 and AAE) were higher than 0.9 and most of them were higher than 0.95.
Because the light absorption within the visible wavelengths of some samples were below the detection limit of the spectrophotometer, 19 of 39 samples were available for the calculations of AAE.
Note that the left samples of sites 51b and 58, which showed abnormal absorption and fluorescence spectra compared to other samples, were supposed to be contaminated, and thereby these two samples were not considered for the absorption and fluorescence analyses.

Soluble ions
The major soluble ions of meltwater samples were analyzed with an ion chromatograph (Dionex, Sunnyvale, CA, USA) using an AS11 column for the anions SO 2− 4 , NO − 3 , Cl − , and F − and a CS12 column for the cations Na + , K + , Ca 2+ , Mg 2+ , and NH + 4 .The soluble ions showed no obvious differences between filtered and unfiltered samples (Pu et al., 2017).According to Pio et al. (2007), the K + can be separated into three fractions: sea salt (ss), dust, and others (the fraction not related to sea salt and mineral dust, nss-ndust).The nss-ndust-K + is a good marker for biomass burning (Pio et al., 2007).The Ca 2+ concentrations of our samples were mostly higher than that of Na + , leading to much larger mass ratios of Ca 2+ /Na + than that in seawater (0.038) (Pio et al., 2007).Therefore, Ca 2+ is dominated by the dust fraction and not corrected to nss-Ca 2+ in this study.nss-ndust-K + is calculated using the following formulas (Pio et al., 2007): ss-Na + = Na + − 0.14 × Ca 2+ , (8) In Eq. ( 7), 0.038 is the mass ratio of K + /Na + in seawater (Pio et al., 2007).In Eq. ( 8), the lowest mass ratio of Na + /Ca 2+ of our samples (0.14) is used to evaluate the dust fraction of Na + .Similarly, the lowest mass ratio of K + /Ca 2+ (0.028) is used in Eq. ( 9) to calculate the dust fraction of K + .

Hierarchical cluster analysis
A hierarchical cluster analysis was used to classify the samples based on the relative abundances of three PARAFAC components.Euclidean distance was used to estimate the distances between samples.Before determining the clustering method, the cophenetic correlation coefficients, criteria for assessing the efficiency of clustering methods (Saracli et al., 2013), for the cluster trees created by different methods were calculated, including unweighted average, weighted average, centroid, farthest neighbor, shortest neighbor, weighted center of mass, and Ward's methods.Finally, the unweighted average method was chosen due to the highest cophenetic correlation coefficients.A total of four clusters were determined and labeled as clusters A-D.

Air mass backward trajectories and active-fire data
Air mass backward trajectory has been widely used to identify the sources of air pollution (Stein et al., 2015) and also successfully applied to studies of impurities in snow (Hegg et al., 2010;Wang et al., 2015;Zhang et al., 2013) The distributions of a CDOM (280) and S 275−295 are shown in Fig. 3, and the corresponding values are summarized in Table 1. a CDOM (280) ranged widely from 0.15 to 10.57 m −1 with an average of 1.69 ± 1.80 m −1 (mean ± standard deviation).The highest value appeared at site 67 (10.57m −1 ), followed by sites 53, 79, and 47 (5.25, 3.13, and 3.11 m −1 ).Most of these samples were collected from snow drifts.These values were higher than the a CDOM (280) of CDOM in snow, ice, and cryoconite on the Tibetan Plateau (typically lower than 2.0 m −1 ) (Feng et al., 2016(Feng et al., , 2017)).The lowest value was found at site 66 (0.15 m −1 ), followed by sites 70, 82, 73, and 83 (0.21, 0.23, 0.30, and 0.31 m −1 ), and these values were comparable with the absorption of soluble species in Alaskan snow with typical values of 0.1-0.15m −1 at 250 nm (Beine et al., 2011).Some of these samples comprised freshly fallen snow and some were collected at remote sites that were far from pollution sources (Pu et al., 2017).
The values of S 275−295 ranged from 0.0129 to 0.0389 nm −1 with an average of 0.0243±0.0073nm −1 .S 275−295 has never been reported in the terrestrial snow and ice samples before but is widely measured in the aquatic environments.For example, Hansen et al. (2016) summarized the S 275−295 for oceanic and terrestrial water systems.The values are in the range of 0.020-0.030nm −1 for ocean, 0.010-0.020nm −1 for coastal water, and 0.012-0.023nm −1 for rivers and wet-  1, which ranged from 4.41 to 8.91 with an average of 5.55 ± 1.11.This value is comparable with the average AAE of HULIS extracted from Alaskan snow (6.11, from 300 to 550 nm) (Voisin et al., 2012).
The detailed results of each region are discussed below.Region 1 (sites 47-52) is located in the eastern Tibetan Plateau, which is typically higher than 4000 m above sea level.In this region, the snowpack is usually patchy and thin (Fig. 2a).During windy weather, local soil can be blown and deposited on the snow surface, which had been observed by previous studies (Pu et al., 2017;Ye et al., 2012).Moreover, the filters for samples in this region were in yellow due to the high loading of soil dust.The average a CDOM (280) was the highest among all five regions (2.30 ± 0.52 m −1 ), and the S 275−295 fell in the range of 0.0170-0.0212nm −1 (0.0188 ± 0.0015 nm −1 ), which shows similar values of leaching for permafrost on the Tibetan Plateau (Wang et al., 2018).
Region 3 (sites 60, 62, 63, and 80-84) is the most developed part of Xinjiang, and major industrial cities are located here (e.g., Urumqi, Shihezi, Kuytun, and Karamay).Therefore, human activities may dominate the contribution of snow CDOM in this region.However, the a CDOM (280) values were mostly less than 1.0 m −1 except at sites 60 and 84, with a low average of 0.93 ± 0.68 m −1 .Because samples of these sites were almost new-fallen snow, the deposition of pollutants to the snowpack can be quite slight.Sites 60 and 84 were both close to industrial cities (Fig. 1 in Pu et al., 2017), and locally anthropogenic pollutants may be responsible for the high a CDOM (280) (2.39 and 1.65 m −1 ).The average S 275−295 was 0.0218 ± 0.0057 nm −1 in this region.

PARAFAC components
The EEMs of snow samples were analyzed by PARAFAC model, and three fluorescent components (C1-C3) were identified (Fig. 4).The corresponding excitation and emission loading spectra of each component are shown in Fig. S2.The excitation-emission (Ex-Em) wavelengths of each component's fluorescence peaks are summarized in Table 2   C1 showed a primary peak at < 240/453 nm for Ex-Em, which was similar to the component 1 reported by Stedmon and Markager (2005b) (Ex-Em = < 250/448).This kind of fluorophore absorbs light mainly in the UVC band and shows a broad emission peak, which is usually identified as a terrestrial FDOM (Stedmon et al., 2003).The appearance of a secondary peak at longer excitation wavelength (Ex-Em = 305/453 nm) may indicate that C1 is more aromatic and has a higher molecular weight (Coble et al., 1998).C1 also resembled another terrestrial fluorophore, namely component 4 in Stedmon and Markager (2005b) (Ex-Em = < 250(360)/440), which has been widely found in natural freshwater environments and even water-extracted organic matter in aerosols (Chen et al., 2016;Mladenov et al., 2011;Zhang et al., 2009;Zhao et al., 2016).
C2 had a primary (secondary) peak at < 240(300)/393 nm (Ex-Em), which was first measured in the oceanic system by Coble (1996).Subsequently, Stedmon et al. (2003) found a similar fluorophore (component 4 therein) in a terrestrially dominated estuary region.The following studies suggested that the C2-like components are also linked to microbial activity and phytoplankton degradation in natural aquatic systems (Yamashita et al., 2008;Zhang et al., 2009) or DOM in waste water from anthropogenic sources (Stedmon and Markager, 2005b).
C3 is a typical fluorophore that is categorized as tyrosine-like FDOM and that exhibits Ex-Em pairs of < 240(270)/315 nm.C3 reflects autochthonously labile DOM produced by biological processes (Stedmon et al., 2003) and has been commonly reported in previous studies of natural water bodies and water extraction of aerosols (Chen et al., 2016;Murphy et al., 2008;Stedmon and Markager, 2005a).

Regional variation in PARAFAC components
Figure 5 shows the variations of three fluorescent components among regions, including intensities and relative contributions.Overall, C2 was the most intense fluorophore and accounted for 42 % on average of the total fluorescence intensity of all samples, followed by C3 (38 % on average) and C1 (20 % on average).Compared to glacial snow and ice samples, which were dominated by protein-like substances (Dubnick et al., 2010;Feng et al., 2016), the seasonal snow samples in this study showed fewer microbial characteristic.According to Thieme et al. (2016), although we might underestimate %C1 (approximately 3 %) and overestimate %C2 (approximately 6 %) due to the preservation artifacts, it only slightly changes the results shown here.
In Qinghai (region 1), the most obvious feature was that C1 accounted for approximately 35 % of the total fluorescence intensity on average.This value was significantly higher than that of the other regions.In contrast, %C3 was quite low (24 % on average).This result was mainly due to the high F max (C1) in region 1, since the regional variation of F max (C3) was slight (Fig. 5).
In Xinjiang (regions 2-5), %C1 varied by region, while %C2 and %C3 were roughly equal.In region 2, %C1 was also high (25 % on average).However, %C1 showed the lowest value (9 % on average) in region 3, where most of the samples were new-fallen snow (7 of 8 sites).The great difference between %C1 and %C2 in this region indicated different sources of these two humic-like components.In regions 4 and 5, %C1 were nearly double of that in region 3 (both were approximately 17 % on average).
At sites 54 and 82, the relative abundance of C3 exceeded 70 %, which was approximately two-fold higher than the average of the whole data set (38 %).This result can be explained in two ways: (1) lower inputs of C1 and C2, and (2) greater biological activities being available in the snowpack at these sites.We found lichens near these two sites (Fig. S3), providing evidence for the latter.
At site 67, the fluorescence intensities were highest among all samples (0.30, 0.39, and 0.38 RU for C1, C2, and C3), especially for C3.The average F max (C3) was 0.10 RU for all samples excluding site 67, with a low standard deviation of 0.02 RU, and this value was approximately one-fourth of that at site 67.Therefore, rather than owing to biological activity alone, the extremely high F max (C3) of site 67 may be due to other sources, for instance, some organic compounds released from diesel combustion may show similar spectra (Mladenov et al., 2011).
To assess the similarities and differences between samples, a hierarchical cluster analysis based on the relative intensities of fluorescent components was conducted (Fig. 6).The snow samples were separated into four clusters (clusters A- D) (Fig. S4).Samples classified into clusters A and B were dominant.The high %C1, which was 34 % on average, was the most remarkable feature of cluster A and led to a low %C3 (26 % on average).All samples in region 1 and most samples in region 2 were assigned to cluster A. For cluster B, %C1 was low (13 % on average), and %C3 (47 % on average) was slightly higher than %C2 (40 % on average).For the sites in northern Xinjiang (regions 4 and 5), most samples were classified into cluster B. The samples assigned to cluster C, including those of sites 60, 62, 69, 72, 76, and 84, showed the dominant contribution of C2 (57 % on average).
www.the-cryosphere.net/13/157/2019/The Cryosphere, 13, 157-175, 2019 Half of these samples were found in region 3, and the others were dispersed in regions 4 and 5. Cluster D contained only two samples from sites 54 and 82.The difference between cluster D and the others was an extremely high contribution of protein-like component C3 (73 % on average), which indicated the high bioavailability of snow CDOM.

Fluorescence-derived indices
The regional variations of three established fluorescencederived indices are shown in Fig. 7 and the values of each site are listed in Table 1.The HIX values fell into the range of 0.16-3.20,with an average of 1.21 ± 0.78.The highest HIX appeared in region 1 (2.21±0.42),demonstrating a high degree of humification of snow CDOM.The lowest value was found in region 3 (0.62 ± 0.37), which suggests that the CDOM was fresh.This finding is easily explained by the fact that nearly all snow samples in this region were new-fallen snow.Compared to other types of samples (Table 3), the HIX of snow CDOM across northwestern China was higher than that of spring water (Birdwell and Engel, 2010); comparable to those of cryoconite in glaciers from the Tibetan Plateau (Feng et al., 2016), inland lakes (Zhang et al., 2010), and North Pacific Ocean water (Helms et al., 2013); lower than those of cave water (Birdwell and Engel, 2010), estuarine water (Huguet et al., 2009), fog water (Birdwell and Valsaraj, 2010), groundwater (Huang et al., 2015), water extraction of alpine aerosol (Xie et al., 2016), and urban aerosol (Mladenov et al., 2011).
According to McKnight et al. (2001) and Huguet et al. (2009), the values of FI > 1.9 or BIX > 1.0 indicate microbially derived DOM.The BIX and FI for the snow samples were typically below 1.0 and 1.9, implying an unremarkably autochthonous characteristic.The regional distributions of BIX and FI corresponded with that of HIX.The samples with highest average BIX and FI were in region 3 (0.93±0.25 and 1.60 ± 0.15), and the samples in region 1 exhibited the lowest average values (0.49±0.05 and 1.29±0.05).The BIX and FI of different types of samples changed little, while the only exception was the FI of cryoconite in glaciers from the Tibetan Plateau (Feng et al., 2016), which was approximately twice as high as those of the other samples.

Source identification of PARAFAC components
As mentioned in Sect.3.1, the snowpack in Qinghai was strongly influenced by local soil dust, which was confirmed by the lowest S 275−295 , leading to a high %C1.This result implied that the terrestrial fluorophore C1 was mainly from the soil HULIS, and demonstrated the invariably terrestrial source of the C1-like fluorophores, regardless of whether in the natural water bodies, aerosol water extraction, or snow.
Correlation analyses were conducted to assess the potential sources of C2.The mutual relationships between PARAFAC components were shown in Fig. 8.The F max (C3) of site 67 was much higher than that of any other sample (shown as red markers in Fig. 8), which can strongly influence the results of the correlation analysis.When excluding the data of site 67, the R 2 between F max (C1) and F max (C3) fell from 0.316 to 0.082, and the linear relationship became nonsignificant (Fig. 8b).Therefore, we used the data set that excludes site 67 in the analysis, and the results are shown below.F max (C1) and F max (C2) were linearly correlated with each other (R 2 = 0.332, p < 0.001); however, the R 2 value was much lower than those in previous studies of natural water, for instance R 2 = 0.63 for inland lakes (Zhao et al., 2016) and R 2 = 0.88 for inland rivers (Zhang et al., 2011).This result indicated that soil dust only partly accounted for the source of C2.Meanwhile, a significant linear relationship (R 2 = 0.364, p < 0.001) was found between F max (C2) and F max (C3), which implied a potential microbial source for C2 and was consistent with the finding of Yamashita et al. (2008).Not surprisingly, F max (C1) and F max (C3) showed no correlation (R 2 = 0.082, p > 0.05).Furthermore, the correlation coefficients of F max and three major ions were calculated.The results are shown in Table 4. F max (C2) showed significant and positive correlations with these ions (p < 0.001).The secondary ions SO 2− 4 and NO − 3 are commonly considered to be the markers of anthropogenic emissions from the burning of fossil fuel, such as oil and coal (Doherty et al., 2014;Oh et al., 2011;Pu et al., 2017), and nss−ndust−K + is a good tracer of biomass burning (Pio et al., 2007).Therefore, C2 may also originate from anthropogenic pollution and biomass burning.Overall, there were four potential sources of snow CDOM in our study.Since the contribution of microbial-derived C3 to a CDOM (280) was relatively low compared to C1 and C2 (Fig. S5), three major sources were identified, i.e., soil dust, biomass burning, and anthropogenic pollution.
The ratios of intensities for PARAFAC components can be a useful tool for tracing the CDOM sources (Murphy et al., 2008).In this study, the ratio of F max (C2) and F max (C1) was applied to assess the relative contributions of soil and nonsoil (i.e., biomass burning and anthropogenic pollution) sources for snow CDOM (Fig. 9a).An analysis of variations (ANOVA) was used to test the differences among regions.Regions 1 and 2 showed low ratios of F max (C2) and F max (C1) (1.20±0.14 and 1.76±0.82),indicating the strong influence from local soil dust.The values of F max (C2)/F max (C1) for regions 3, 4, and 5 were significantly higher (ANOVA, p < 0.05) with averages of 5.57 ± 2.26, 3.17 ± 1.47, and 3.02 ± 1.22.This result implied that the sources of snow CDOM in these regions were different from those in regions 1 and 2, and were mainly from nonsoil sources.

Regional variations
The regional variations of CDOM sources are discussed below using analyses of absorption and fluorescence characteristics, chemical species, and air mass backward trajectories.
In addition, the sources of CDOM in snow are also compared with those of particulate light absorption of ILAPs.
In Qinghai (region 1), the lowest regional average and slight variation in S 275−295 indicated the dominant contribution of terrestrial sources for snow CDOM (e.g., local soil dust) (Fichot and Benner, 2012;Helms et al., 2008).This result was also verified by the fluorescence properties: the highest HIX and %C1 and the lowest F max (C2)/F max (C1).
Figure 10.72 h air mass backward trajectories at 500 m a.g.l. with the initial positions at representative sites (shown as yellow pentagrams) in each region.Trajectories were calculated 4 times per day for a period of 30 days preceding the sampling date at a given site by HYSPLIT (version 4, NOAA) except for panel (c).Since the snow was fresh at site 84, the trajectories were derived for 5 days preceding the sampling date.The red lines show the air masses that passed through the active fires before reaching the receptor sites, and the blue lines are those did not pass the fires.The white dots represent the typical industrial cities in Xinjiang, i.e., Karamay, Kuytun, Shihezi, and Urumqi from west to east.

Comparing the light absorption by CDOM and BC
Figure 11 shows the relative contributions of CDOM and BC to light absorption.As mentioned above, light absorption within visible wavelengths was available for 19 samples.The BC concentrations in surface snow were obtained from Pu et al. (2017), and the MAC and AAE of BC used in the calculation were 6.3 m 2 g −1 (550 nm) and 1.1 (Pu et al., 2017).
Most of these sites were assigned to cluster A, except sites 60, 69, and 84.As discussed in Sect.3.2.2,sites of cluster A exhibited high values of %C1, indicating that CDOM mainly originated from soil dust.At sites 50, 52, and 79, the light absorptions of CDOM and BC were roughly equal at 400 nm.It was not only due to the high abundances of CDOM but www.the-cryosphere.net/13/157/2019/The Cryosphere, 13, 157-175, 2019 also the relatively low BC mixing ratios in snow (approximately 30 ng g −1 , Pu et al., 2017).Sites 60, 69, and 84, where the fluorescence intensities were dominated by C2, were the only three sites assigned to cluster C. Biomass burning and anthropogenic pollution (e.g., fossil fuel combustion) are both major sources of fluorophore C2 and BC.Therefore, the BC mixing ratios were approximately 300 ng g −1 at these sites (Pu et al., 2017), leading to quite low ratios of light absorption due to CDOM and BC (approximately 0.03 at 400 nm).At other sites, this value was typically in the range of 0.1 to 0.4.In summary, the light absorption of CDOM was 0.34 ± 0.34 times that for BC at 400 nm.At 500 nm, this value decreased quickly to 0.10 ± 0.11 due to the stronger wavelength dependence of CDOM absorption.This finding is quite different from the results for snow samples collected at Barrow, Alaska.As presented by Doherty et al. (2013), the mixing ratio of BC in Barrow snow ranged between 10 and 30 ng g −1 ; however, the equivalent BC mixing ratio of CDOM absorption was only 0.14 ng g −1 at 400 nm and 0.07 ng g −1 at 550 nm (Dang and Hegg, 2014).Hence, the absorption of CDOM in Alaskan snow can be safely ignored, but this does not appear reasonable for some areas across northwestern China.
Previous studies have focused on the insoluble particles (e.g., BC, ISOC, and MD) in seasonal snow (Doherty et al., 2010(Doherty et al., , 2014;;Pu et al., 2017;Wang et al., 2013).The above discussion indicates that in some specific areas of northwestern China, the absorption of CDOM in snow was remark-able.In addition to the results of cluster analysis, we summarized several absorption-and fluorescence-related indices of these sites.The average S 275−295 (0.0187 ± 0.0022) of these 19 sites was lowest compared to those of regions 1-5.The averages of BIX (0.60 ± 0.20), FI (1.31 ± 0.09), and F max (C2)/F max (C1) (1.66 ± 1.03) were lower than those of region 2, in which the influence of local soil dust was obvious.Besides, the averages of HIX (1.87 ± 0.57) and %C1 (30 %) were higher than those of region 2.These results confirmed that the CDOM of these sites was undoubtedly from terrestrial origins (e.g., wind-blown soil dust).Hence, we suggest that the absorption by CDOM in the snowpack, which is heavily affected by soil, cannot be ignored.

Conclusions
Seasonal snow samples were collected across northwestern China from January to February 2012.The a CDOM (280) and S 275−295 of snow CDOM ranged from 0.15 to 10.57 m −1 and 0.0129 to 0.0389 nm −1 .The average value of a CDOM (280) (1.69 ± 1.80 m −1 ) was approximately 10 times higher than that in Alaska (Beine et al., 2011).Samples in Qinghai (region 1) exhibited the highest average a CDOM (280) (2.30±0.52 m −1 ) and the lowest average S 275−295 (0.0188± 0.0015 nm −1 ), resulting from the strong influence of local soil dust.Lower average a CDOM (280) appeared in central Xinjiang (region 3, 0.93 ± 0.68 m −1 ), where almost all the samples were collected from new-fallen snow, and northwestern Xinjiang (region 4, 0.80 ± 0.62 m −1 when excluding site 67), which was far from industrial areas.In the Tianshan Mountains (region 2) and northeastern Xinjiang (region 5), the average values of a CDOM (280) were 2.00 ± 1.50 m −1 and 1.17 ± 0.63 m −1 .For all sites in Qinghai and some sites in Xinjiang (19 of 39 sites), the light absorption of CDOM cannot be neglected and was even remarkable (0.34 ± 0.34 times relative to BC at 400 nm) due to the high contribution of CDOM from soil dust.Hence, we suggest that the CDOM absorption in visible wavelengths at such sites should be taken into consideration in future studies.
Based on PARAFAC analysis, two humic-like fluorophores (C1 and C2) and one protein-like fluorophore (C3) were identified.In Qinghai, %C1 (35 % on average) was much higher than those of the other regions; besides, the highest HIX, the lowest BIX and FI were also found.In Xinjiang (regions 2-5), %C1 varied among the regions.In region 2, C1 accounted for approximately 25 % to the total fluorescence, followed by regions 4 and 5 (both 17 % on average).In region 3, C1 contribution was the lowest (9 % on average), and the values of fluorescence-derived indices also showed consistent results (the lowest HIX, the highest BIX and FI).A hierarchical cluster analysis was used to classify samples into four clusters (A-D) based on the relative intensities of three fluorescent components.All samples in region 1 and most samples in region 2 were assigned to cluster A (a high contribution of C1).The number of samples assigned to cluster B (roughly equal contributions of C2 and C3) and cluster C (a dominant contribution of C2) were nearly even in region 3.For regions 4 and 5, most samples were classified into cluster B. Only two samples were assigned to cluster D due to the dominant contribution of C3.
According to the correlation analysis between F max (C2) and three major ions (SO 2− 4 , NO − 3 , and nss − ndust−K + ), as well as the mutual relationships among three fluorescent components, C2 exhibited potential sources of soil dust, microbial activity, anthropogenic pollution, and biomass burning.Furthermore, the regional distribution of CDOM sources was assessed by using variations of F max (C2)/F max (C1), (SO 2− 4 + NO − 3 )/nss − ndust − K + , Cl − /Na + , and air mass backward trajectory analysis.The major sources were soil dust for regions 1-2, anthropogenic pollution for region 3, and biomass burning for regions 4-5.
This study investigated the optical characteristics and potential sources of CDOM in seasonal snow across northwestern China.Future studies should focus on the molecular characteristics of snow CDOM and the relationship with optical properties, which is of great importance to the energy budget of snowpack and the global carbon cycle.
Data availability.All data sets and codes used to produce this study can be obtained by contacting Xin Wang (wxin@lzu.edu.cn).The elevation data used in this study are available at: https://rda.ucar.edu/datasets/ds759.3/description (last access: 10 January 2019, National Geophysical Data Center/NESDIS/NOAA/U.S.Department of Commerce, 2001).
Author contributions.YZ drew the figures and wrote the manuscript.HW and YZ analyzed the data of light absorption, fluorescence, and ions, and also performed the backward trajectory analysis.JL, WP, and XW conducted the experiments.QC conducted the PARAFAC analysis.XW and YZ designed the experiments.All authors discussed and edited the manuscript.
Competing interests.The authors declare that they have no conflict of interest.

Figure 1 .
Figure 1.(a) Location of study area and sample site distribution across northwestern China.The site numbers and regional groupings are shown in panel (b) for Xinjiang and (c) for Qinghai.Sample sites are divided into five groups indicated by different symbols, and the land cover types are represented in different colors, as shown in the legend in panel (a).The D indicates that the sample was collected from a snow drift, and the F indicates that the sample was fresh snow.The elevation is shown in the contour plot.

Figure 2 .
Figure 2. Pictures of typical sample sites.

Figure 3 .
Figure 3. a CDOM (280) and S 275−295 for sites in (a, c) Xinjiang and (b, d) Qinghai.The five regions are indicated by different symbols (same as Fig. 1).

Figure 5 .
Figure 5. Variations of fluorescent components among regions.The box plots show the F max of different components.The boxes denote the 25th and 75th quantiles, and the horizontal lines represent the 50th quantiles (medians), the averages are shown as dots; the whiskers denote the maximum and minimum data within 1.5 times the interquartile range, and the data points out of this range are marked with crosses (+).The pie charts show the regionally average relative contributions of three components.C1, C2, and C3 are represented in red, yellow, and blue, both for the box plots and pie charts.The percentages on the left of the panel are the averages of %C1-%C3 for the whole data set.

Figure 6 .
Figure 6.Hierarchical cluster analysis based on the relative contributions of fluorescent components.

Figure 7 .
Figure 7. Variations of HIX (red), BIX (blue), and FI (green) among regions.The meaning of each part of the box is same as that in Fig. 5.

Figure 11 .
Figure 11.Relative absorption contributions of CDOM and BC at (a) 400 nm and (b) 500 nm.

Table 1 .
Statistics on absorption and fluorescence parameters for snow CDOM at each site.Note that N.A. stands for no data.

Table 2 .
Descriptions of fluorescent components identified by PARAFAC analysis.The secondary peaks are shown in parentheses.

Table 3 .
The fluorescence-derived indices in this study and comparison with those of natural water bodies and water extractions of aerosols reported by other studies.Note that average values are shown in parentheses.

Table 4 .
Pearson's correlation coefficients (r) of major ions and F max for fluorescent components when excluding the data from site 67; the results for the entire data set are shown in parentheses.Note that * denotes p < 0.001.