<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0" article-type="research-article">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">TC</journal-id><journal-title-group>
    <journal-title>The Cryosphere</journal-title>
    <abbrev-journal-title abbrev-type="publisher">TC</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">The Cryosphere</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1994-0424</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/tc-20-3313-2026</article-id><title-group><article-title>Learning to melt: Emulating Greenland surface melt from a polar RCM with machine learning</article-title><alt-title>Melt emulation</alt-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes" rid="aff1 aff2">
          <name><surname>Schlager</surname><given-names>Elke</given-names></name>
          <email>eschlager@envs.au.dk</email>
        <ext-link>https://orcid.org/0009-0004-8880-6283</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff3">
          <name><surname>Scher</surname><given-names>Sebastian</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-6314-8833</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff2">
          <name><surname>Mottram</surname><given-names>Ruth H.</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-1016-1997</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Langen</surname><given-names>Peter L.</given-names></name>
          
        <ext-link>https://orcid.org/0000-0003-2185-012X</ext-link></contrib>
        <aff id="aff1"><label>1</label><institution>Department of Environmental Science, iClimate, Aarhus University, Roskilde, Denmark</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>National Centre for Climate Research (NCKF), Danish Meteorological Institute, Copenhagen, Denmark</institution>
        </aff>
        <aff id="aff3"><label>3</label><institution>Wegener Center for Climate and Global Change and Department of Geography and Regional Science, University of Graz, Graz, Austria</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Elke Schlager (eschlager@envs.au.dk)</corresp></author-notes><pub-date><day>3</day><month>June</month><year>2026</year></pub-date>
      
      <volume>20</volume>
      <issue>6</issue>
      <fpage>3313</fpage><lpage>3343</lpage>
      <history>
        <date date-type="received"><day>1</day><month>January</month><year>2026</year></date>
           <date date-type="rev-request"><day>27</day><month>January</month><year>2026</year></date>
           <date date-type="rev-recd"><day>17</day><month>April</month><year>2026</year></date>
           <date date-type="accepted"><day>18</day><month>May</month><year>2026</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2026 Elke Schlager et al.</copyright-statement>
        <copyright-year>2026</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://tc.copernicus.org/articles/20/3313/2026/tc-20-3313-2026.html">This article is available from https://tc.copernicus.org/articles/20/3313/2026/tc-20-3313-2026.html</self-uri><self-uri xlink:href="https://tc.copernicus.org/articles/20/3313/2026/tc-20-3313-2026.pdf">The full text article is available as a PDF file from https://tc.copernicus.org/articles/20/3313/2026/tc-20-3313-2026.pdf</self-uri>
      <abstract><title>Abstract</title>

      <p id="d2e125">Predicting surface melt on the Greenland ice sheet is critical for understanding surface mass balance (SMB) and its sensitivity to a changing climate. Polar regional climate models (RCMs) are the primary tools for simulating melt and projecting future SMB, but different models produce significantly different results. However, they are too computationally expensive to create the large ensembles needed to quantify this uncertainty. We develop a neural network based emulator that predicts daily surface melt from atmospheric variables, trained on output from the polar RCM HIRHAM5 and its firn model DMIHH forced by ERA-Interim. The emulator uses a physics-informed design combining short-term weather with long-term climate memory, capturing both immediate atmospheric forcing and accumulated firn characteristics. Input selection study shows that turbulent heat fluxes, downwelling radiation, and precipitation together with seasonal encoding suffice to reproduce surface melt. The emulator achieves mean absolute error below 0.21 mm w.e. per day relative to the surface melt produced by DMIHH across all six Greenland drainage basins, with the errors primarily attributable to spatial over-smoothing. Our work demonstrates that machine learning can successfully emulate firn model behavior from climate forcing alone with computational costs orders of magnitude lower than traditional simulations. Once retrained for specific climate forcings, the emulator thus enables extensive ensemble projections. Furthermore, the modular architecture can be readily adapted to emulate other SMB quantities such as runoff. This represents a crucial first step toward computationally efficient emulation of polar regional climate models and surrogate modeling of SMB components in Earth system modeling.</p>
  </abstract>
    
<funding-group>
<award-group id="gs1">
<funding-source>Novo Nordisk</funding-source>
<award-id>NNF23OC0081251</award-id>
</award-group>
</funding-group>
</article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d2e137">The Greenland ice sheet (GrIS) is losing mass today, and it will continue to do so in the future. Runoff caused by surface melt is a key component of surface mass balance (SMB), with increased melt leading to a negative SMB and subsequently a decrease in mass.  With ongoing atmospheric warming and the positive feedback of surface melt with albedo and the lowering of elevation, surface melt will increase further in the future  <xref ref-type="bibr" rid="bib1.bibx31" id="paren.1"/>. While future projections agree that melt will increase, the predictions are inconsistent about the rate of this melt increase and associated SMB loss <xref ref-type="bibr" rid="bib1.bibx16" id="paren.2"/>.</p>
      <p id="d2e146">Polar regional climate models (RCMs) combined with a firn model show the best agreement with observational data among different modeling approaches for predicting SMB <xref ref-type="bibr" rid="bib1.bibx14" id="paren.3"/>. But they are also the most complex and computationally intensive approach, as they require running two numerical models: (a) the RCM to downscale atmospheric data from a forcing global climate model (GCM), and (b) the firn model to infer surface mass balance based on the surface energy balance (SEB) and firn properties that evolve from local atmospheric conditions. Both models are based on physical processes, modeled with computationally expensive numerical schemes. Despite their common framework, polar RCMs such as HIRHAM <xref ref-type="bibr" rid="bib1.bibx34 bib1.bibx26" id="paren.4"/>, MAR <xref ref-type="bibr" rid="bib1.bibx12 bib1.bibx13" id="paren.5"/>, and RACMO <xref ref-type="bibr" rid="bib1.bibx35" id="paren.6"/> differ considerably in their assumptions, physical process representations, parameterizations, and numerical schemes, producing notable discrepancies in their SMB estimates <xref ref-type="bibr" rid="bib1.bibx14" id="paren.7"/>. These discrepancies increase even more in future scenarios, as the models exhibit different sensitivities to atmospheric warming, leading to a varying increase in melt water production, and in turn to a positive feedback amplifying the models' discrepancies even more <xref ref-type="bibr" rid="bib1.bibx16" id="paren.8"/>. The use of a diverse ensemble of simulations is therefore crucial to mitigate model-specific biases and to improve robustness and reliability of projections, as well as to evaluate the projections statistically <xref ref-type="bibr" rid="bib1.bibx16 bib1.bibx30" id="paren.9"/>. However, the high computational costs of  polar RCMs limit the generation of large ensembles.</p>
      <p id="d2e171">In recent years, Machine Learning (ML) has shown great prospects for simulating various parts of the Earth system <xref ref-type="bibr" rid="bib1.bibx37 bib1.bibx8 bib1.bibx38" id="paren.10"/>. ML-based emulators of climate model output are data-driven approximations of complex physical models that can produce predictions at a fraction of the computational cost of the respective physical models, allowing for the production of large ensembles <xref ref-type="bibr" rid="bib1.bibx46" id="paren.11"/>. More specifically, an emulator of a polar RCM could be used to extend existing simulations over longer time periods, to complement projections under various Shared Socio-economic Pathway (SSP) scenarios which were not covered by the original simulations, or to produce simulations under new climate forcings. Additionally, emulators can be used not only for creating more simulations, but also for rapid hypothesis testing and sensitivity analysis. Moreover, besides emulating Earth system components standalone, ML emulators can also be used for surrogate modeling within numerical Earth system models.</p>
      <p id="d2e180">ML applications for firn and SMB modeling include approximating observational data using reanalysis climate data <xref ref-type="bibr" rid="bib1.bibx52 bib1.bibx36 bib1.bibx4 bib1.bibx2" id="paren.12"/>, improving model estimates by observational data <xref ref-type="bibr" rid="bib1.bibx22 bib1.bibx9" id="paren.13"/>, or emulating firn model outputs <xref ref-type="bibr" rid="bib1.bibx44 bib1.bibx54 bib1.bibx50 bib1.bibx11" id="paren.14"/>. These studies predominantly employ tree-based methods like XGBoost, which builds an ensemble of decision trees where each successive tree corrects errors from previous ones <xref ref-type="bibr" rid="bib1.bibx6" id="paren.15"/>, or neural networks (NNs), which stack nonlinear transformations to approximate complex functions <xref ref-type="bibr" rid="bib1.bibx18" id="paren.16"/>. While XGBoost offers simpler deployment, NNs can achieve superior performance given sufficient training data and careful tuning <xref ref-type="bibr" rid="bib1.bibx49 bib1.bibx57" id="paren.17"/>.</p>
      <p id="d2e203">Furthermore, ML techniques for image super-resolution methods that reconstruct high-resolution images from low-resolution counterparts have been adapted for downscaling meteorological variables <xref ref-type="bibr" rid="bib1.bibx45 bib1.bibx20" id="paren.18"/>. <xref ref-type="bibr" rid="bib1.bibx9" id="text.19"/> uses such a ML downscaling approach to downscale SMB to a higher spatial resolution, while <xref ref-type="bibr" rid="bib1.bibx50" id="text.20"/> infers SMB directly from coarser-resolution climate fields at a coarser resolution, effectively emulating both the RCM and the firn model, simultaneously. While this approach delivers SMB estimates directly from GCM data without intermediate climate downscaling, separating these processes offers key advantages. Downscaling is inherently spatial, requiring models to capture atmospheric dynamics and topographic effects, whereas firn models operate on one-dimensional vertical columns. A standalone firn emulator trained on local atmospheric forcing learns location-agnostic relationships that generalize across different locations and climate states, rather than encoding spatial patterns specific to the training domain. This design enhances robustness, since unusual melt events triggered by specific atmospheric conditions can be predicted even at atypical locations if similar conditions occurred elsewhere in the training data.</p>
      <p id="d2e215">Existing works operate at annual or monthly timescales and use only the current timestamp data for regression, with only <xref ref-type="bibr" rid="bib1.bibx54" id="text.21"/> and <xref ref-type="bibr" rid="bib1.bibx52" id="text.22"/> incorporating multi-scale temporal aggregation to capture slow-evolving firn properties. However, the temporal history required for daily melt predictions remains unexplored. Firn properties integrate conditions over years to decades, whereas surface melt responds to daily atmospheric forcing but also evolves gradually, suggesting temporal history is necessary. Additionally, most approaches rely on air temperature as a melt proxy rather than direct physical drivers of the surface energy balance, i.e., turbulent heat fluxes and radiation, even though these variables are available in reanalysis products and polar RCMs.</p>
      <p id="d2e224">In this study, we address these gaps by developing a NN for emulating GrIS surface melt, based on model output from the polar RCM HIRHAM5 (forced by ERA-Interim). We thereby focus on two key aspects that distinguish our approach from existing work: First, we operate at daily resolution, introducing new challenges due to the high temporal variability of surface melt compared to monthly or annually aggregated data. To address this, we design our NN in a physically informed way with separate modules, extracting short- and long-term information from daily and 10 year aggregated data separately. We systematically test the impact of including short-term history and long-term information, as well as configurations including albedo or the previous day's melt. Second, we expand the range of input variables compared to previous studies. Contrary to most existing work which rely primarily on air temperature and precipitation data, we also include the direct drivers of the SEB. We then conduct a systematic input selection analysis to assess the impact of including atmospheric variables beyond the direct SEB drivers, as well as seasonal information. We find that snowfall, rainfall, and seasonality are important predictors in addition to the direct SEB terms, while including air temperature does not increase model performance any further.</p>
      <p id="d2e227">This work is a first step toward comprehensive emulation of SMB processes in polar RCMs, with surface melt being a key component of GrIS SMB. While our location-agnostic architecture is applicable to different polar RCMs and future scenarios, application requires validation under data distribution shift and potential retraining on the respective RCM outputs, which we identify as important next steps. We demonstrate that our NN accurately reproduces daily melt patterns across the whole GrIS, while requiring only approximately one minute of computation per year on a single CPU, demonstrating that emulators can match physical model accuracy at a fraction of the computational cost.</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Materials and methods</title>
<sec id="Ch1.S2.SS1">
  <label>2.1</label><title>Data</title>
      <p id="d2e245">For the creation of our emulator we use daily output of the polar RCM HIRHAM5 with its firn model DMIHH, forced by ERA-Interim for the period 1980–2016 taken from <xref ref-type="bibr" rid="bib1.bibx26" id="text.23"/>. We first describe the HIRHAM5-DMIHH simulation data to understand its internal consistency and the physical relationships it builds upon, which motivates the design of our NN and the systematic input selection study performed in this work. Thereafter, we describe the data cleaning and processing for training the NN.</p>
<sec id="Ch1.S2.SS1.SSS1">
  <label>2.1.1</label><title>HIRHAM5-DMIHH simulation data</title>
      <p id="d2e258">In <xref ref-type="bibr" rid="bib1.bibx26" id="text.24"/>, HIRHAM5 is run at a spatial resolution of <inline-formula><mml:math id="M1" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.05</mml:mn><mml:mi mathvariant="italic">°</mml:mi><mml:mo>×</mml:mo><mml:mn mathvariant="normal">0.05</mml:mn><mml:mi mathvariant="italic">°</mml:mi></mml:mrow></mml:math></inline-formula>, with a snow layer of <inline-formula><mml:math id="M2" display="inline"><mml:mrow class="unit"><mml:mn mathvariant="normal">10</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">m</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">w</mml:mi><mml:mo>.</mml:mo><mml:mi mathvariant="normal">e</mml:mi><mml:mo>.</mml:mo></mml:mrow></mml:math></inline-formula> and an internal albedo parameterization dependent on surface temperature, to determine the energy and moisture flux interactions at and below the surface <xref ref-type="bibr" rid="bib1.bibx28" id="paren.25"/>. The resulting atmospheric fields are then used to force the firn model DMIHH offline.</p>
      <p id="d2e303">DMIHH is a one dimensional model organized in 32 layers totaling <inline-formula><mml:math id="M3" display="inline"><mml:mrow class="unit"><mml:mn mathvariant="normal">60</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">m</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">w</mml:mi><mml:mo>.</mml:mo><mml:mi mathvariant="normal">e</mml:mi><mml:mo>.</mml:mo></mml:mrow></mml:math></inline-formula> of firn. HIRHAM5 mass fluxes (snowfall, rainfall, deposition, sublimation), downwelling shortwave and longwave radiation (SW<sup>↓</sup>, LW<sup>↓</sup>), and latent and sensible heat fluxes (LHF, SHF) update DMIHH hourly at the surface. The surface state is determined via the SEB, with surface temperature being bound above by <inline-formula><mml:math id="M6" display="inline"><mml:mrow class="unit"><mml:mn mathvariant="normal">0</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">°</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:mrow></mml:math></inline-formula>, and any excess energy producing surface melt:

              <disp-formula id="Ch1.E1" content-type="numbered"><label>1</label><mml:math id="M7" display="block"><mml:mrow><mml:mi mathvariant="normal">SEB</mml:mi><mml:mo>=</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mi mathvariant="italic">α</mml:mi><mml:mo>)</mml:mo><mml:msup><mml:mi mathvariant="normal">SW</mml:mi><mml:mo>↓</mml:mo></mml:msup><mml:mo>+</mml:mo><mml:msup><mml:mi mathvariant="normal">LW</mml:mi><mml:mo>↓</mml:mo></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="normal">LW</mml:mi><mml:mo>↑</mml:mo></mml:msup><mml:mo>+</mml:mo><mml:mi mathvariant="normal">LHF</mml:mi><mml:mo>+</mml:mo><mml:mi mathvariant="normal">SHF</mml:mi><mml:mo>+</mml:mo><mml:mi mathvariant="normal">GHF</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

            where <inline-formula><mml:math id="M8" display="inline"><mml:mrow><mml:mi>G</mml:mi><mml:mi>H</mml:mi><mml:mi>F</mml:mi></mml:mrow></mml:math></inline-formula> is the ground heat flux from the layers below, and LW<sup>↑</sup> the upwelling longwave radiation depending on the surface temperature. The surface albedo <inline-formula><mml:math id="M10" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> is calculated internally based on surface temperature and snow depth, with <inline-formula><mml:math id="M11" display="inline"><mml:mrow><mml:mi mathvariant="italic">α</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.85</mml:mn></mml:mrow></mml:math></inline-formula> for cold snow, decreasing to <inline-formula><mml:math id="M12" display="inline"><mml:mn mathvariant="normal">0.65</mml:mn></mml:math></inline-formula> as the surface warms toward the melting point, and dropping further as surface snow diminishes, with <inline-formula><mml:math id="M13" display="inline"><mml:mrow><mml:mi mathvariant="italic">α</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.4</mml:mn></mml:mrow></mml:math></inline-formula> at lowest for bare ice exposure (i.e., zero snow fraction in the surface layer). While snowfall and rainfall are not directly part of the SEB, they have an indirect influence via the albedo: rainfall is simulated with a temperature of <inline-formula><mml:math id="M14" display="inline"><mml:mrow class="unit"><mml:mn mathvariant="normal">0</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">°</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:mrow></mml:math></inline-formula> and thus warm the surface if its below the melting point; snowfall increases albedo if snow depth is low. The firn model runs offline, which means that atmospheric data applies forcing to the surface without feedback from the surface back to the atmosphere. Consequently, the latent and sensible heat fluxes are prescribed by the RCM without adjustment for actual surface characteristics.</p>
      <p id="d2e485">The firn pack in the DMIHH simulation was spun up by repeatedly cycling 1980–1989 until decadal means of runoff and subsurface temperatures reached a steady state, then run continuously for 1980–2016, with daily aggregated SMB outputs and their associated driving HIRHAM5 outputs saved for the full period. This temporal aggregation potentially constrains melt emulation accuracy by smoothing sub-daily variability and short-lived extremes that control timing and peak rates of melt. However, daily resolution enables broader utility in future applications, aligning with the typical daily temporal resolution of both GCM and RCM output, as well as current ML downscaling approaches.</p>
</sec>
<sec id="Ch1.S2.SS1.SSS2">
  <label>2.1.2</label><title>Data cleaning</title>
      <p id="d2e496">Although data from physical simulations are generally consistent, they may still contain extreme values and outliers caused by numerical instabilities, although these are very rare. While these individual outliers do not affect the overall assessment of modeled melt or other properties, they can disproportionately influence gradient-based optimization, distort the loss function, and impair stable convergence during ML model training. In order to decide how to treat these extreme values and outliers we must consider their source, their impact, and relation to other variables.</p>
      <p id="d2e499">Aggregating rainfall and snowfall to daily values during postprocessing, some negative rainfall values arise as numerical artifacts. We set these values to zero for consistency. Furthermore, rainfall and snowfall show some suspiciously high values of up to about 700 and 1000 mm w.e. per day, respectively. Although they are likely caused by numerical instabilities in high relief topography, we do not correct these values to preserve consistency between the precipitation and the firn model output data used as target. However, we transform both rainfall and snowfall data by applying the logarithm (after adding 1, ensuring the transformation is well-defined for 0 values), which compresses these very high values. While this transformation neither adds nor removes information, it improves the data's usability for ML model training by preventing small values from vanishing in numerical noise relative to large outliers.</p>
      <p id="d2e502">In rare instances, numerical instabilities also produce events of surface temperature runaway, leading to unrealistic surface temperatures approaching <inline-formula><mml:math id="M15" display="inline"><mml:mrow class="unit"><mml:mn mathvariant="normal">0</mml:mn><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="normal">K</mml:mi></mml:mrow></mml:math></inline-formula> due to excessive surface cooling from sensible heat flux as low as <inline-formula><mml:math id="M16" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>400 <inline-formula><mml:math id="M17" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">W</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">m</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>. We correct these heat flux values to a lower bound of <inline-formula><mml:math id="M18" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>140 <inline-formula><mml:math id="M19" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">W</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">m</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, which was the lowest sensible heat flux observed in simulations unaffected by the runaway. The ranges for sensible and latent heat flux span several hundred watts per square meter, yet the majority of the data are concentrated in a very narrow range. To expand the narrow and compress the large ranges, we transform heat flux values <inline-formula><mml:math id="M20" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula> with a symmetric logarithm transformation <inline-formula><mml:math id="M21" display="inline"><mml:mrow><mml:mtext>symlog</mml:mtext><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo><mml:mo>:=</mml:mo><mml:mtext>sgn</mml:mtext><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo><mml:mo>⋅</mml:mo><mml:mi>log⁡</mml:mi><mml:mo>(</mml:mo><mml:mo>|</mml:mo><mml:mi>x</mml:mi><mml:mo>|</mml:mo><mml:mo>/</mml:mo><mml:mi mathvariant="italic">λ</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> according to <xref ref-type="bibr" rid="bib1.bibx56" id="text.26"/>, with <inline-formula><mml:math id="M22" display="inline"><mml:mrow><mml:mi mathvariant="italic">λ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula> for LHF and <inline-formula><mml:math id="M23" display="inline"><mml:mrow><mml:mi mathvariant="italic">λ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">15</mml:mn></mml:mrow></mml:math></inline-formula> for SHF, corresponding to approximately half of their respective inter-quartile ranges.</p>
</sec>
<sec id="Ch1.S2.SS1.SSS3">
  <label>2.1.3</label><title>Data preparation for training</title>
      <p id="d2e654">For training a ML model we need a data set to train the model (training set), a dataset to monitor the progress during training and guide decisions during model development (validation set), and an until then unseen set for the final model evaluation (test set). When splitting data, internal dependence structures (e.g., temporal or spatial autocorrelation, data grouping/clustering) together with the modeling task need to be considered to avoid information leakage into the training set <xref ref-type="bibr" rid="bib1.bibx3 bib1.bibx40" id="paren.27"/>. For temporally structured data, this necessitates splitting along the time dimension in contiguous blocks (e.g., entire years) rather than individual samples (e.g., single days) to preserve temporal independence and ensure the validation and test sets remain truly distinct from the training set.</p>
      <p id="d2e660">We split the data spanning from 1980 to 2016 into the three separate periods: a training period (1990–2013), a validation period (2014), and a test period (2016). The first 10 years of data (1980–1989) are included indirectly in the training set as decadal averages for the long-term module. To prevent information leakage between training and final evaluation, we introduced a one year gap between the validation and the test period, which is a common strategy for evaluating models using data with a structural dependency <xref ref-type="bibr" rid="bib1.bibx19 bib1.bibx40" id="paren.28"/>. Although the time windows for calculating the decadal mean conditions still overlap, this overlap is considered to have negligible impact on the validity of the tests because this study focuses on a relatively short period lacking significant trends. Therefore, the decadal means primarily capture location specific characteristics rather than temporal development.</p>
      <p id="d2e666">Our data split prioritizes maximizing the training set to expose the model to a broad range of atmospheric conditions and interannual variability. Consequently, we allocated only single years for validation (2014) and test (2016) periods. While the representativeness of the test set is important to report unbiased final model performance, the representativeness of the validation set is important to make fair decision during model development. Expanding either set to two or three years would reduce training data without guaranteeing improved representativeness. To mitigate potential bias from using single-year periods, we instead verified that both 2014 and 2016 were non-anomalous years in terms of total melt extent (Appendix <xref ref-type="sec" rid="App1.Ch1.S1"/>).</p>
      <p id="d2e671">For a model to successfully learn a specific task, we need not only a big quantity of data, but sufficient quantity of data that is actually relevant for the specific task we aim to solve. The spatial resolution of <inline-formula><mml:math id="M24" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.05</mml:mn><mml:mi mathvariant="italic">°</mml:mi><mml:mo>×</mml:mo><mml:mn mathvariant="normal">0.05</mml:mn><mml:mi mathvariant="italic">°</mml:mi></mml:mrow></mml:math></inline-formula> results in 58 391 grid cells for the GrIS, and thus just as many samples for every single day in the data set. While selecting a long training period of 23 years ensures large interannual variability in the training set, not all of this data contains information that is relevant for solving our task, since surface melt is zero, or very close to zero, for large areas of the ice sheet and a substantial part of the year. Additionally, neighboring grid cells often exhibit very similar behavior, meaning that large spatial datasets can contain considerable redundancy with respect to the melt patterns relevant to model learning.</p>
      <p id="d2e691">To reduce the portion of low-relevance data samples, we apply strategic sub-sampling in time and space. The temporal sub-sampling reduces the number of no/low melt days by randomly sampling 100 d per year from a normal distribution centered around the 24 July (as peak of the melt season) and a standard deviation of 60 d. The spatial sub-sampling, on the other hand, favors grid cells in high melt areas over dry areas by selecting 5000 grid-cells according to predefined zone specific probabilities (see Appendix <xref ref-type="sec" rid="App1.Ch1.S2"/>). By sub-sampling 5000 grid cells and 100 d per year from the training period, we create a training set of 12 million samples, reducing training costs significantly compared to the full data set of 511 million samples. While only about 6 % of the full training set show melt above 1 mm w.e. per day, the temporal and spatial sub-sampling increase this ratio to approximately 26 %. This sub-sampling stabilizes the training process; without it, the network tends to converge to the trivial local minimum of constantly predicting zero melt. We monitor training progress on the likewise sub-sampled validation set, but conduct model comparisons on the entire validation period data set to inform design choices, and report final performance on the entire test period data set.</p>
      <p id="d2e696">As inputs for our NN, we select atmospheric variables that contribute directly to the SEB, as defined in Eq. (<xref ref-type="disp-formula" rid="Ch1.E1"/>), or indirectly: sensible heat flux SHF, latent heat flux LHF, downwelling longwave LW<sup>↓</sup>, downwelling shortwave radiation SW<sup>↓</sup>, rainfall <inline-formula><mml:math id="M27" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>, snowfall <inline-formula><mml:math id="M28" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>, and near-surface (2 m) air temperature <inline-formula><mml:math id="M29" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula>. We denote these input variables at daily resolution <inline-formula><mml:math id="M30" display="inline"><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mi>d</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. In addition, we include the cyclic features <inline-formula><mml:math id="M31" display="inline"><mml:mrow><mml:mi>cos⁡</mml:mi><mml:mo>(</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>⋅</mml:mo><mml:mi mathvariant="italic">π</mml:mi></mml:mrow><mml:mn mathvariant="normal">365</mml:mn></mml:mfrac></mml:mstyle><mml:mi mathvariant="normal">DOY</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M32" display="inline"><mml:mrow><mml:mi>sin⁡</mml:mi><mml:mo>(</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>⋅</mml:mo><mml:mi mathvariant="italic">π</mml:mi></mml:mrow><mml:mn mathvariant="normal">365</mml:mn></mml:mfrac></mml:mstyle><mml:mi mathvariant="normal">DOY</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> (with DOY the day of year) to encode seasonality in our model. Lastly, long-term history is represented by aggregates of the input variables over the previous 10 year average, denoted by <inline-formula><mml:math id="M33" display="inline"><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mi>l</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. The target variable is daily surface melt. Since absorbed shortwave radiation is highly sensitive to surface albedo <inline-formula><mml:math id="M34" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula>, we also test model setups incorporating <inline-formula><mml:math id="M35" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula>. As a last preprocessing step, all the data is standard scaled to zero mean and unit variance with respect to the sub-sampled training data.</p>
</sec>
</sec>
<sec id="Ch1.S2.SS2">
  <label>2.2</label><title>Emulator design</title>
      <p id="d2e837">Since we aim to emulate surface melt modeled by the DMIHH firn model, we refer to that modeled surface melt as the “true” melt. The output of our ML model is called the “predicted” melt or “prediction”. With SM<inline-formula><mml:math id="M36" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> denoting the true melt, and <inline-formula><mml:math id="M37" display="inline"><mml:mrow><mml:mover accent="true"><mml:mi mathvariant="normal">SM</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> the predicted melt for a specific day <inline-formula><mml:math id="M38" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula>, we formulate our problem as finding a function <inline-formula><mml:math id="M39" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula> such that

            <disp-formula id="Ch1.E2" content-type="numbered"><label>2</label><mml:math id="M40" display="block"><mml:mrow><mml:mi mathvariant="normal">SM</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mo>≈</mml:mo><mml:mover accent="true"><mml:mi mathvariant="normal">SM</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mi>f</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mi>d</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="script">T</mml:mi><mml:mo>)</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mi>l</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mo>,</mml:mo><mml:mover accent="true"><mml:mi mathvariant="normal">SM</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo><mml:mo>)</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

          for all days <inline-formula><mml:math id="M41" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula>, where <inline-formula><mml:math id="M42" display="inline"><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mi>d</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="script">T</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> represents the daily input variables for <inline-formula><mml:math id="M43" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> d <inline-formula><mml:math id="M44" display="inline"><mml:mrow><mml:mi mathvariant="script">T</mml:mi><mml:mo>=</mml:mo><mml:mo mathvariant="italic">{</mml:mo><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mi>N</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mi>t</mml:mi><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M45" display="inline"><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mi>l</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> the long-term inputs leading up to day <inline-formula><mml:math id="M46" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula>, and <inline-formula><mml:math id="M47" display="inline"><mml:mrow><mml:mover accent="true"><mml:mi mathvariant="normal">SM</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> the melt prediction of the previous day.</p>
      <p id="d2e1074">Many approaches can yield an appropriate approximation function <inline-formula><mml:math id="M48" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>, and there is a priori no single best algorithm. Rather, the choice depends on factors such as the type of the problem, the type and complexity of the data, as well as its quality and quantity available for training <xref ref-type="bibr" rid="bib1.bibx18 bib1.bibx57" id="paren.29"/>. Given the highly nonlinear characteristics of the data set and the large amount of available data, we have chosen a NN for regressing the surface melt based on atmospheric variables <xref ref-type="bibr" rid="bib1.bibx2 bib1.bibx49" id="paren.30"/>.</p>
<sec id="Ch1.S2.SS2.SSS1">
  <label>2.2.1</label><title>Neural Network Fundamentals</title>
      <p id="d2e1097">A NN is composed of <inline-formula><mml:math id="M49" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula> layers, which represents a concatenation of functions such that <inline-formula><mml:math id="M50" display="inline"><mml:mrow><mml:mi>f</mml:mi><mml:mo>=</mml:mo><mml:msub><mml:mi>f</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>∘</mml:mo><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub><mml:mo>∘</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>∘</mml:mo><mml:msub><mml:mi>f</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>. Each <inline-formula><mml:math id="M51" display="inline"><mml:mrow><mml:msub><mml:mi>f</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> takes the previous layer's outputs, combines them linearly, applies a nonlinear activation function, and passes the results forward to the next layer. The coefficients of these linear combinations are the parameters that are optimized during training of the NN. The output layer <inline-formula><mml:math id="M52" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula> yields the final prediction, while the previous layers are called the hidden layers. Each layer consists of neurons which represent the number of outputs of each function <inline-formula><mml:math id="M53" display="inline"><mml:mrow><mml:msub><mml:mi>f</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, and NN structures are often summarized by denoting the number of neurons of each (hidden) layer, e.g., a network with three layers containing 10 neurons each is written as “10-10-10” <xref ref-type="bibr" rid="bib1.bibx18" id="paren.31"/>. The hidden layers typically share the same non-linear activation function, e.g., Rectified Linear Unit (ReLU). We use LeakyReLU activation function in the hidden layers, a computationally efficient variant of ReLU that mitigates the drawbacks of classical ReLU <xref ref-type="bibr" rid="bib1.bibx10" id="paren.32"/>. The output layer often uses a different activation function than the hidden layers and is based on the task. Since our task is a regression task, we use no activation function. The whole network operates on scaled data; during inference the model predictions are reverse-scaled and clipped at zero to ensure non-negative melt predictions.</p>

      <fig id="F1"><label>Figure 1</label><caption><p id="d2e1183">Schematic of the modular neural network with short-term module, long-term module, and regression module to output <inline-formula><mml:math id="M54" display="inline"><mml:mover accent="true"><mml:mi mathvariant="normal">SM</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover></mml:math></inline-formula>, the additional output <inline-formula><mml:math id="M55" display="inline"><mml:mover accent="true"><mml:mi mathvariant="italic">α</mml:mi><mml:mo mathvariant="normal" stretchy="true">^</mml:mo></mml:mover></mml:math></inline-formula> and the autoregressive element (dashed elements).</p></caption>
            <graphic xlink:href="https://tc.copernicus.org/articles/20/3313/2026/tc-20-3313-2026-f01.png"/>

          </fig>

</sec>
<sec id="Ch1.S2.SS2.SSS2">
  <label>2.2.2</label><title>Network architecture</title>
      <p id="d2e1220">We designed the NN in three separate modules: two feature extraction modules and one regression module, as depicted in Fig. <xref ref-type="fig" rid="F1"/>. The first extraction module, the short-term module, takes the daily inputs <inline-formula><mml:math id="M56" display="inline"><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mi>d</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> of days <inline-formula><mml:math id="M57" display="inline"><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mi>N</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mi>t</mml:mi></mml:mrow></mml:math></inline-formula> as inputs to determine the current forcing on the surface layer. Furthermore, it incorporates the seasonal encodings for day <inline-formula><mml:math id="M58" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula>, intended to approximate the firn cold content through seasonal indicators.</p>
      <p id="d2e1263">The second module, the long-term module, uses the long-term inputs <inline-formula><mml:math id="M59" display="inline"><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mi>l</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. These long-term inputs are motivated by the spin-up procedure common to firn models and are meant to describe the prevailing firn characteristics at a site. We include these inputs alongside the seasonal encoding to provide location-specific information on firn cold content and bare-ice exposure proneness, which are factors that affect surface albedo and, consequently, surface melt.</p>
      <p id="d2e1277">The outputs of the two modules are then concatenated and fed into the final regression module which outputs the melt prediction <inline-formula><mml:math id="M60" display="inline"><mml:mrow><mml:mover accent="true"><mml:mi mathvariant="normal">SM</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> for that day. While our proposed model configuration, Modular NN, consists of these three modules only, the network can be extended further by an autoregressive element, or by additional target variables. The autoregressive element (dashed arrow in Fig. <xref ref-type="fig" rid="F1"/>) feeds the melt of the previous day back into the daily module of the network to include the self–enhancing effect of surface melt. Alternatively, we use albedo as additional target variable since simultaneously learning albedo might improve melt predictions <xref ref-type="bibr" rid="bib1.bibx7 bib1.bibx41" id="paren.33"/>. In this case, Eq. (<xref ref-type="disp-formula" rid="Ch1.E2"/>) holds true not only for melt SM but simultaneously for albedo. While the weights of the NN <inline-formula><mml:math id="M61" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula> are shared for predicting melt and albedo throughout most of the network, the regression module branches before its final layer, with separate last hidden layers for the two output neurons predicting melt and albedo (indicated by the dashed neuron connections for albedo output in Fig. <xref ref-type="fig" rid="F1"/>).</p>
      <p id="d2e1314">Given this network design described above, we develop our emulator in two stages. First, we optimize the network configuration while keeping the atmospheric input variables fixed. Then, using the best configuration, we study how using different subsets of atmospheric variables as inputs affect model performance by retraining on multiple variable subsets.</p>

<table-wrap id="T1"><label>Table 1</label><caption><p id="d2e1321">Overview of network configurations ordered by complexity, with their respective use of number of previous days <inline-formula><mml:math id="M62" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> in the short-term module, the long-term module, <inline-formula><mml:math id="M63" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> as additional target variable (Multitarget NN), and the autoregressive element (AutoregNN). Our main configuration Modular NN is indicated in italic. While  Modular NN does not use <inline-formula><mml:math id="M64" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula>, we also trained a version of Modular NN with <inline-formula><mml:math id="M65" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> as input as an upper benchmark for performance.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="left"/>
     <oasis:colspec colnum="4" colname="col4" align="left"/>
     <oasis:colspec colnum="5" colname="col5" align="left"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M66" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">long-term</oasis:entry>
         <oasis:entry colname="col4">autoreg</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M67" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula></oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">Regression NN</oasis:entry>
         <oasis:entry colname="col2">0</oasis:entry>
         <oasis:entry colname="col3">no</oasis:entry>
         <oasis:entry colname="col4">no</oasis:entry>
         <oasis:entry colname="col5">no</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Short-term NN</oasis:entry>
         <oasis:entry colname="col2">9</oasis:entry>
         <oasis:entry colname="col3">no</oasis:entry>
         <oasis:entry colname="col4">no</oasis:entry>
         <oasis:entry colname="col5">no</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><italic>Modular NN</italic></oasis:entry>
         <oasis:entry colname="col2">9</oasis:entry>
         <oasis:entry colname="col3">yes</oasis:entry>
         <oasis:entry colname="col4">no</oasis:entry>
         <oasis:entry colname="col5">no</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Modular NN w. <inline-formula><mml:math id="M68" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">9</oasis:entry>
         <oasis:entry colname="col3">yes</oasis:entry>
         <oasis:entry colname="col4">no</oasis:entry>
         <oasis:entry colname="col5">as input</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Multitarget NN</oasis:entry>
         <oasis:entry colname="col2">9</oasis:entry>
         <oasis:entry colname="col3">yes</oasis:entry>
         <oasis:entry colname="col4">no</oasis:entry>
         <oasis:entry colname="col5">as target</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Autoreg NN</oasis:entry>
         <oasis:entry colname="col2">9</oasis:entry>
         <oasis:entry colname="col3">yes</oasis:entry>
         <oasis:entry colname="col4">yes</oasis:entry>
         <oasis:entry colname="col5">no</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

</sec>
<sec id="Ch1.S2.SS2.SSS3">
  <label>2.2.3</label><title>Network configuration study</title>
      <p id="d2e1526">To determine the necessary yet sufficient network modules and elements, we developed our configuration iteratively, by sequentially tuning the network configurations listed in Table <xref ref-type="table" rid="T1"/>. During this process we keep the atmospheric variables used for inputs fixed: As daily inputs <inline-formula><mml:math id="M69" display="inline"><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mi>d</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> we use all the relevant atmospheric variables SHF, LHF, LW<sup>↓</sup>, SW<sup>↓</sup>, <inline-formula><mml:math id="M72" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M73" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>, and <inline-formula><mml:math id="M74" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula>, together with the seasonal encoding. As proxy inputs for the long-term conditions <inline-formula><mml:math id="M75" display="inline"><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mi>l</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> we use the 10 year averages of <inline-formula><mml:math id="M76" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M77" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula>.</p>
      <p id="d2e1607">The simplest configuration, <italic>Regression NN</italic>, consists only of the short-term module, using climate conditions of only the current day <inline-formula><mml:math id="M78" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula> as input (i.e. <inline-formula><mml:math id="M79" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>), representing a pure regression model without any short-term or long-term historical information. This results in 9 input features (7 climate variables <inline-formula><mml:math id="M80" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> 2 seasonal encoding variables), and we choose the hidden layers of the network to be 64-128-128-64-32-16-16, terminating in a single output neuron for melt prediction.</p>
      <p id="d2e1639">After having established this regression baseline, we investigate the impact of historical information by tuning <italic>Modular NN</italic> consisting of a short-term and a long-term module. The short-term module takes daily inputs from both the current and several previous days. The long-term module has two input neurons, and we choose two hidden layers of 32 neurons each. The hidden layers of the short-term module comprise 128-128-256, and the regression module combining the extracted feature from the two modules comprises 256-128-64-32-16-16. We tuned the number of the of preceding days <inline-formula><mml:math id="M81" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>∈</mml:mo><mml:mo>[</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">10</mml:mn><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> and found that <inline-formula><mml:math id="M82" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">9</mml:mn></mml:mrow></mml:math></inline-formula> resulted in the best performance on the validation set. To investigate the necessity of the long-term module, we fix the number of past days at <inline-formula><mml:math id="M83" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">9</mml:mn></mml:mrow></mml:math></inline-formula> and tune the network without the long-term module (<italic>Short-term NN</italic>). This results in a network with 72 input features and hidden layers 128-128-256-256-128-64-32-16-16.</p>
      <p id="d2e1692">To investigate the impact of the surface albedo <inline-formula><mml:math id="M84" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula>, we train a configuration <italic>Modular NN w.</italic> <inline-formula><mml:math id="M85" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula>, where <inline-formula><mml:math id="M86" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> is included in the daily inputs <inline-formula><mml:math id="M87" display="inline"><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mi>d</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. While this model cannot be used for firn emulation since <inline-formula><mml:math id="M88" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> itself is an output of the firn model, this model serves as upper benchmark. In contrast, the model <italic>Multitarget NN</italic> does not use <inline-formula><mml:math id="M89" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> as additional input but as additional target, so that the model can learn <inline-formula><mml:math id="M90" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> and its impact on melt production. Multitarget NN results in two output neurons, one for melt and one for albedo. Next, we test whether incorporating an autoregressive step in the Modular NN improves model performance. <italic>Autoreg NN</italic> is thus also based on the configuration of Modular NN, but the melt of the previous day <inline-formula><mml:math id="M91" display="inline"><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> is used as additional input. Autoreg NN is tuned using the true previous melt as input during training (called teacher-forcing), although we also experimented with autoregressive learning approaches using the previous prediction directly during training. More details on the tuning process can be found in Appendix <xref ref-type="sec" rid="App1.Ch1.S3"/>.</p>
</sec>
<sec id="Ch1.S2.SS2.SSS4">
  <label>2.2.4</label><title>Systematic input selection study</title>
      <p id="d2e1781">We perform a systematic input selection study on our identified best-performing configuration Modular NN. In ML, such studies are called input ablation or sequential input selection: input variables are iteratively added or removed (ablated), and model performance is assessed after retraining. This approach assesses the necessity of each input variable for solving the task, contrasting with post-hoc feature importance analyses (e.g., SHAP <xref ref-type="bibr" rid="bib1.bibx29" id="altparen.34"/>) that evaluate how much a given feature contributes to an already trained model's prediction <xref ref-type="bibr" rid="bib1.bibx15 bib1.bibx32" id="paren.35"/>. The goal is to include all necessary variables while excluding redundant ones, as they inflate model complexity and can damage model performance, generalization, and interpretability. Finding the relevant features is not always straightforward and should not rely solely on feature correlation, but include domain knowledge <xref ref-type="bibr" rid="bib1.bibx48" id="paren.36"/>.</p>
      <p id="d2e1793">Since we know the direct drivers and mechanisms of melt production, i.e., the SEB terms according to Eq. (<xref ref-type="disp-formula" rid="Ch1.E1"/>), we can systematically aggregate and remove inputs based on domain knowledge. Our input selection study is summarized in Table <xref ref-type="table" rid="T2"/>, and is twofold: The first part is to sum up the energy input terms LW<sup>↓</sup>, LHF, and SHF, since they all contribute equally to the SEB. The NN needs only to learn from total energy input rather than the individual components, improving model robustness while reducing complexity. In contrast, SW<sup>↓</sup> needs to be handled separately, because albedo strongly determines the energy finally absorbed by the surface. We define <italic>Modular NN EBMT</italic><sub><italic>d</italic></sub>, which uses daily inputs of the energy balance variables EB (with LW<sup>↓</sup>, LHF, and SHF summed up, and SW<sup>↓</sup> separate), the mass variables <inline-formula><mml:math id="M97" display="inline"><mml:mi>M</mml:mi></mml:math></inline-formula> (<inline-formula><mml:math id="M98" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M99" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>), and the near-surface temperature  <inline-formula><mml:math id="M100" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula>. The long-term inputs remain the same as in the default Modular NN (10 year averages of <inline-formula><mml:math id="M101" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M102" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula>).</p>
      <p id="d2e1891">The second part is a classical input selection study, sequentially adding and removing input variables. For this we start with a more extensive model, <italic>Modular NN EBMT</italic>, using all the atmospheric variables from the short-term module also in the long-term module. This allows us to determine the importance of long-term energy input rather than relying on temperature <inline-formula><mml:math id="M103" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula> as a proxy. We then sequentially remove variables: removing <inline-formula><mml:math id="M104" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula> yields <italic>Modular NN EBM</italic> (energy balance and mass terms only); removing <inline-formula><mml:math id="M105" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula> yields <italic>Modular NN EBS</italic> (energy balance and snowfall); removing <inline-formula><mml:math id="M106" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> yields <italic>Modular NN EBR</italic> (energy balance and rainfall).</p>

<table-wrap id="T2" specific-use="star"><label>Table 2</label><caption><p id="d2e1939">Overview of the input selection study: Modular NN with the inputs used in the configuration study serves as the baseline. With the network configuration fixed, we vary the atmospheric variables used as inputs. With <inline-formula><mml:math id="M107" display="inline"><mml:mi>d</mml:mi></mml:math></inline-formula> we indicate variables used in the short-term module, and <inline-formula><mml:math id="M108" display="inline"><mml:mi>l</mml:mi></mml:math></inline-formula> variables that are used as 10 year averages in the long-term module. The starred versions <inline-formula><mml:math id="M109" display="inline"><mml:mrow><mml:msup><mml:mi>d</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M110" display="inline"><mml:mrow><mml:msup><mml:mi>l</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> indicate that the variables spanned by the line are summed up and provided as a single input to the model.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="9">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="center"/>
     <oasis:colspec colnum="3" colname="col3" align="center"/>
     <oasis:colspec colnum="4" colname="col4" align="center"/>
     <oasis:colspec colnum="5" colname="col5" align="center"/>
     <oasis:colspec colnum="6" colname="col6" align="center"/>
     <oasis:colspec colnum="7" colname="col7" align="center"/>
     <oasis:colspec colnum="8" colname="col8" align="center"/>
     <oasis:colspec colnum="9" colname="col9" align="center"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">seasonality</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M111" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M112" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M113" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6">SW<sup>↓</sup></oasis:entry>
         <oasis:entry colname="col7">LW<sup>↓</sup></oasis:entry>
         <oasis:entry colname="col8">LHF</oasis:entry>
         <oasis:entry colname="col9">SHF</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">Modular NN</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M116" display="inline"><mml:mi>d</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M117" display="inline"><mml:mrow><mml:mi>d</mml:mi><mml:mo>,</mml:mo><mml:mi>l</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M118" display="inline"><mml:mrow><mml:mi>d</mml:mi><mml:mo>,</mml:mo><mml:mi>l</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M119" display="inline"><mml:mi>d</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M120" display="inline"><mml:mi>d</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M121" display="inline"><mml:mi>d</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M122" display="inline"><mml:mi>d</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col9"><inline-formula><mml:math id="M123" display="inline"><mml:mi>d</mml:mi></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Modular NN EBMT<sub><italic>d</italic></sub></oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M125" display="inline"><mml:mi>d</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M126" display="inline"><mml:mrow><mml:mi>d</mml:mi><mml:mo>,</mml:mo><mml:mi>l</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M127" display="inline"><mml:mrow><mml:mi>d</mml:mi><mml:mo>,</mml:mo><mml:mi>l</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M128" display="inline"><mml:mi>d</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M129" display="inline"><mml:mi>d</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col7">–</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M130" display="inline"><mml:mrow><mml:msup><mml:mi>d</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col9">–</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Modular NN EBMT</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M131" display="inline"><mml:mi>d</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M132" display="inline"><mml:mrow><mml:mi>d</mml:mi><mml:mo>,</mml:mo><mml:mi>l</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M133" display="inline"><mml:mrow><mml:mi>d</mml:mi><mml:mo>,</mml:mo><mml:mi>l</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M134" display="inline"><mml:mrow><mml:mi>d</mml:mi><mml:mo>,</mml:mo><mml:mi>l</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M135" display="inline"><mml:mrow><mml:mi>d</mml:mi><mml:mo>,</mml:mo><mml:mi>l</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col7">–</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M136" display="inline"><mml:mrow><mml:msup><mml:mi>d</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M137" display="inline"><mml:mrow><mml:msup><mml:mi>l</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col9">–</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><italic>Modular NN EBM</italic></oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M138" display="inline"><mml:mi>d</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M139" display="inline"><mml:mrow><mml:mi>d</mml:mi><mml:mo>,</mml:mo><mml:mi>l</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M140" display="inline"><mml:mrow><mml:mi>d</mml:mi><mml:mo>,</mml:mo><mml:mi>l</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M141" display="inline"><mml:mrow><mml:mi>d</mml:mi><mml:mo>,</mml:mo><mml:mi>l</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col7">–</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M142" display="inline"><mml:mrow><mml:msup><mml:mi>d</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M143" display="inline"><mml:mrow><mml:msup><mml:mi>l</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col9">–</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Modular NN EBS</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M144" display="inline"><mml:mi>d</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M145" display="inline"><mml:mrow><mml:mi>d</mml:mi><mml:mo>,</mml:mo><mml:mi>l</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M146" display="inline"><mml:mrow><mml:mi>d</mml:mi><mml:mo>,</mml:mo><mml:mi>l</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col7">–</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M147" display="inline"><mml:mrow><mml:msup><mml:mi>d</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M148" display="inline"><mml:mrow><mml:msup><mml:mi>l</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col9">–</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Modular NN EBR</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M149" display="inline"><mml:mi>d</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M150" display="inline"><mml:mrow><mml:mi>d</mml:mi><mml:mo>,</mml:mo><mml:mi>l</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M151" display="inline"><mml:mrow><mml:mi>d</mml:mi><mml:mo>,</mml:mo><mml:mi>l</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col7">–</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M152" display="inline"><mml:mrow><mml:msup><mml:mi>d</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M153" display="inline"><mml:mrow><mml:msup><mml:mi>l</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col9">–</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Modular NN EBM<sub>noDOY</sub></oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M155" display="inline"><mml:mrow><mml:mi>d</mml:mi><mml:mo>,</mml:mo><mml:mi>l</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M156" display="inline"><mml:mrow><mml:mi>d</mml:mi><mml:mo>,</mml:mo><mml:mi>l</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M157" display="inline"><mml:mrow><mml:mi>d</mml:mi><mml:mo>,</mml:mo><mml:mi>l</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col7">–</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M158" display="inline"><mml:mrow><mml:msup><mml:mi>d</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M159" display="inline"><mml:mrow><mml:msup><mml:mi>l</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col9">–</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

</sec>
</sec>
<sec id="Ch1.S2.SS3">
  <label>2.3</label><title>Emulator training</title>
      <p id="d2e2661">Each configuration is trained multiple times for 300 epochs, respectively, tuning the learning rate using the Python library Optuna <xref ref-type="bibr" rid="bib1.bibx1" id="paren.37"/> for Bayesian optimization. We use Adam optimizer <xref ref-type="bibr" rid="bib1.bibx25" id="paren.38"/>, a batch size of 256, a learning rate decay factor of 0.9 every 50 epochs, and gradient clipping to a norm of 1 to stabilize training. For a more detailed description and the results of the tuning procedure, see Appendix <xref ref-type="sec" rid="App1.Ch1.S3"/>.</p>
      <p id="d2e2672">We performed the training on an NVIDIA GRID A100D-40C GPU with 16 vCPUs (128 GB RAM). Individual training runs required approximately 25–45 minutes, depending on the complexity of the configuration. While our network configurations are regarded small in a deep learning context, the large volume of data is the critical factor in training time, and the data loading process remains the bottleneck in our pipeline despite heavy optimizations: For efficient data loading, we saved the training data in zarr chunks by date, with each chunk containing the samples of all 5000 sub-sampled grid cells to minimize number of chunks that need to be opened and loaded. Thus, during training we read batches of 256 chunks, which leads to an effective batch size of <inline-formula><mml:math id="M160" display="inline"><mml:mrow><mml:mn mathvariant="normal">256</mml:mn><mml:mo>⋅</mml:mo><mml:mn mathvariant="normal">5000</mml:mn><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1280</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mn mathvariant="normal">000</mml:mn></mml:mrow></mml:math></inline-formula> samples. The batch size of 256 is thereby limited by the GPU memory, and the data loading is distributed across multiple CPUs in parallel to achieve high data throughput. After the model is trained, generating one year of melt predictions from preprocessed input takes approximately one minute on CPU. Although preprocessing (computing 10-year averages, data cleaning, scaling, and reformatting to zarr files) requires up to two hours, the total computational cost remains far lower than for the physical firn model, which needs 2.5 h on 16 CPUs per simulated year.</p>
</sec>
<sec id="Ch1.S2.SS4">
  <label>2.4</label><title>Evaluation strategy</title>
      <p id="d2e2702">For final evaluation of our models, we use the test set (year 2016), which has not been used during training or to guide any model development decisions. For each configuration, we select the model from the tuning procedure that performed best on the validation data set, and report its root mean square error (RMSE), mean absolute error (MAE), mean bias error (MBE), and the coefficient of determination (<inline-formula><mml:math id="M161" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>). Alongside the performance of our different model configurations, we provide a reference benchmark consisting of a running climatology of surface melt. This climatology is computed from the training set and smoothed with a 15 d moving average. Comparing the emulator skill to this benchmark, we can differentiate whether the emulator simply learned the climatology or whether it also learned to predict the variation and anomalies from climatology, which is the actual purpose of the emulator. We therefore also report the <inline-formula><mml:math id="M162" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> on the anomalies with respect to this climatology (<inline-formula><mml:math id="M163" display="inline"><mml:mrow><mml:msubsup><mml:mi>R</mml:mi><mml:mtext>anom</mml:mtext><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula>). Error tables are further complemented by visual evaluations of true versus predicted melt values, and maps of true and predicted melt alongside with their residuals, defined as predicted minus true melt.</p>
      <p id="d2e2740">For the final model, we evaluate spatial and temporal biases through seasonal melt maps and basin-wise timeseries analysis. SMB outputs are often used in aggregated form per basin, which experience very different climatic regimes in terms of snowfall, long-term net snow accumulation, rainfall, and combination of surface melt drivers <xref ref-type="bibr" rid="bib1.bibx14 bib1.bibx27 bib1.bibx47 bib1.bibx51 bib1.bibx55" id="paren.39"/>. To assess whether our location-agnostic model performs consistently across these different conditions or exhibits basin-specific biases, we evaluate model performance separately for each basin, using the basins definitions shown in Fig. <xref ref-type="fig" rid="F2"/> (from <xref ref-type="bibr" rid="bib1.bibx14" id="altparen.40"/>). Because internal variability produces large inter-annual differences, an evaluation based on the test year alone can be misleading. Therefore, we perform the basin-wise assessment not just for the test year 2016 but across the entire period 1990–2016.</p>

      <fig id="F2"><label>Figure 2</label><caption><p id="d2e2753">Partitioning of Greenland into six basins N (north), NE (north-east), CE (central-east), SE (south-east), SW (south-west), NW (north-west) used in the evaluation.</p></caption>
          <graphic xlink:href="https://tc.copernicus.org/articles/20/3313/2026/tc-20-3313-2026-f02.png"/>

        </fig>

<table-wrap id="T3" specific-use="star"><label>Table 3</label><caption><p id="d2e2766">Performance for surface melt prediction of final models from the configuration study evaluated on the test set. RMSE, MAE, and MBE in mm w.e. per day; <inline-formula><mml:math id="M164" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> the standard coefficient of determination, and <inline-formula><mml:math id="M165" display="inline"><mml:mrow><mml:msubsup><mml:mi>R</mml:mi><mml:mtext>anom</mml:mtext><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula> is relative to anomalies from climatology. The five models are ordered by increasing model complexity. Autoreg NN (teacher) and Modular NN w. albedo use firn model output as input variables and are only listed for comparison.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="6">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">RMSE</oasis:entry>
         <oasis:entry colname="col3">MAE</oasis:entry>
         <oasis:entry colname="col4">MBE</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M166" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M167" display="inline"><mml:mrow><mml:msubsup><mml:mi>R</mml:mi><mml:mtext>anom</mml:mtext><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Climatology</oasis:entry>
         <oasis:entry colname="col2">2.30</oasis:entry>
         <oasis:entry colname="col3">0.56</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M168" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.17</oasis:entry>
         <oasis:entry colname="col5">0.78</oasis:entry>
         <oasis:entry colname="col6">–</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Regression NN</oasis:entry>
         <oasis:entry colname="col2">1.60</oasis:entry>
         <oasis:entry colname="col3">0.40</oasis:entry>
         <oasis:entry colname="col4">0.05</oasis:entry>
         <oasis:entry colname="col5">0.89</oasis:entry>
         <oasis:entry colname="col6">0.51</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Short-term NN</oasis:entry>
         <oasis:entry colname="col2">1.23</oasis:entry>
         <oasis:entry colname="col3">0.26</oasis:entry>
         <oasis:entry colname="col4">0.01</oasis:entry>
         <oasis:entry colname="col5">0.94</oasis:entry>
         <oasis:entry colname="col6">0.71</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><italic>Modular NN</italic></oasis:entry>
         <oasis:entry colname="col2">0.90</oasis:entry>
         <oasis:entry colname="col3">0.18</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M169" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.01</oasis:entry>
         <oasis:entry colname="col5">0.97</oasis:entry>
         <oasis:entry colname="col6">0.85</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Multitarget NN</oasis:entry>
         <oasis:entry colname="col2">0.90</oasis:entry>
         <oasis:entry colname="col3">0.17</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M170" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.01</oasis:entry>
         <oasis:entry colname="col5">0.97</oasis:entry>
         <oasis:entry colname="col6">0.85</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Autoreg NN</oasis:entry>
         <oasis:entry colname="col2">0.90</oasis:entry>
         <oasis:entry colname="col3">0.15</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M171" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.00</oasis:entry>
         <oasis:entry colname="col5">0.97</oasis:entry>
         <oasis:entry colname="col6">0.85</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Autoreg NN (teacher)</oasis:entry>
         <oasis:entry colname="col2">0.40</oasis:entry>
         <oasis:entry colname="col3">0.08</oasis:entry>
         <oasis:entry colname="col4">0.01</oasis:entry>
         <oasis:entry colname="col5">0.99</oasis:entry>
         <oasis:entry colname="col6">0.97</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Modular NN w. <inline-formula><mml:math id="M172" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">0.24</oasis:entry>
         <oasis:entry colname="col3">0.05</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M173" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.00</oasis:entry>
         <oasis:entry colname="col5">1.00</oasis:entry>
         <oasis:entry colname="col6">0.99</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

</sec>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Results and discussion</title>
<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Evaluation of the configuration study</title>
      <p id="d2e3088">We start by analyzing overall mean performance over the entire ice sheet. The performance of the best models from the tuning process (Appendix <xref ref-type="sec" rid="App1.Ch1.S3"/>: Table <xref ref-type="table" rid="TC1"/>; Fig. <xref ref-type="fig" rid="FC1"/>a) on the test set are summarized in Table <xref ref-type="table" rid="T3"/>. All five configurations outperform the climatology benchmark, and performance increases with model complexity from a MAE of 0.40 mm w.e. per day for Regression NN, to 0.26 mm w.e. per day for Short-term NN, 0.18 mm w.e. per day for Modular NN, 0.17 mm w.e. per day for Multitarget NN, and 0.15 mm w.e. per day for Autoreg NN. The other metrics show improvement from Regression NN to Short-term and Modular NN, but then plateau. Thus, information about the past few days and long-term history is essential. Autoreg NN shows that the autoregressive element does not significantly improve performance further in inference mode, where previous predictions are recursively used as input. In contrast, evaluating Autoreg NN in teacher-forced mode demonstrates that information contained in the previous day’s melt is indeed predictive. This discrepancy indicates that, while the autoregressive signal is informative, its benefit is limited in practice by error propagation during recursive rollout.</p>
      <p id="d2e3099">The additional experiment using the Modular NN with <inline-formula><mml:math id="M174" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> highlights the impact of albedo on accurately estimating surface melt, as providing <inline-formula><mml:math id="M175" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> enables the model to achieve  excellent performance with almost perfect <inline-formula><mml:math id="M176" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> scores of 0.99. However, since albedo is an output of the firn model, this model configuration cannot be used as an emulator. Moreover, since albedo is strongly connected with surface temperature (given sufficient snow depth), it implicitly encodes the presence of melt, though it does not uniquely determine its magnitude. As a result, it remains unclear whether the improvement in melt prediction arises from the model accurately capturing absorbed shortwave radiation or simply from albedo serving as a direct indicator of melt on non–bare ice grid cells. In contrast, augmenting the model to predict albedo as an additional target (Multitarget NN) does not improve melt estimates.</p>

      <fig id="F3" specific-use="star"><label>Figure 3</label><caption><p id="d2e3129">2D hexagonal binning plots of true versus predicted surface melt of the test set of the different models. Note that <bold>(d)</bold> uses the firn model output albedo as input variable and is thus only listed for comparison. The logarithmic color bar is valid for bins containing up to <inline-formula><mml:math id="M177" display="inline"><mml:mrow><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">5</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> data points; bins containing more than <inline-formula><mml:math id="M178" display="inline"><mml:mrow><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">5</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> points are indicated in black for better visibility.</p></caption>
          <graphic xlink:href="https://tc.copernicus.org/articles/20/3313/2026/tc-20-3313-2026-f03.jpg"/>

        </fig>

<sec id="Ch1.S3.SS1.SSS1">
  <label>3.1.1</label><title>Point-wise evaluation</title>
      <p id="d2e3171">Across all models, the RMSE substantially exceeds MAE, indicating considerable under- and overestimation of melt across melt events of all magnitudes, as shown by the dark blue colored hexagonal bins in Fig. <xref ref-type="fig" rid="F3"/>. The superior performance of Modular NN, Autoreg NN, and Multitarget NN compared to Regression NN and Short-term NN stems primarily from improved predictions for a majority of data points for melt events up to 50 mm w.e. per day (narrower dark red band in Fig. <xref ref-type="fig" rid="F3"/>c, e, f compared to a and b). For Modular NN, 64 % of the RMSE is attributable to absolute residuals up to 5 mm w.e. per day, a further 34 % to absolute residuals between 5 and 15 mm w.e. per day, and only 2 % of the RMSE to absolute residuals exceeding 15 mm w.e. per day.</p>
      <p id="d2e3178">Modular NN (Fig. <xref ref-type="fig" rid="F3"/>c), Modular NN with <inline-formula><mml:math id="M179" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> (Fig. <xref ref-type="fig" rid="F3"/>d), and Autoreg NN (Fig. <xref ref-type="fig" rid="F3"/>f) show some notable overestimation pattern of melt events above 50 mm w.e. per day. This systematic deviation originates from an unusually early melt event in April in the SW basin, where severe overestimation occurs on only one single day (Fig. <xref ref-type="fig" rid="FD1"/>). While the model successfully predicts the unusually early surface melt for most days during this event, one particular day exhibits sensible heat flux values up to 460 <inline-formula><mml:math id="M180" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">W</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">m</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>. While such extreme sensible heat flux values also appear in the training set, the combination of an anomalously large heat flux with unusually early melt creates conditions that are effectively out-of-sample.</p>

      <fig id="F4" specific-use="star"><label>Figure 4</label><caption><p id="d2e3216">Surface melt for 21 July 2016 with melt extent (ME) in km<sup>2</sup>, median and IQR of melt in mm w.e. per day. <bold>(a)</bold> true melt, <bold>(b)</bold> and <bold>(c)</bold> predicted melt (left panels) and residuals (right panels) of Modular NN and Autoreg NN (in inference mode). <bold>(d)</bold> and <bold>(e)</bold> show the predictions and residuals of Autoreg NN in teacher-forced mode, and of Modular NN with albedo as additional input; both these models cannot be used to produce predictions from climate forcing only, as they use firn model outputs as inputs. <bold>(f)</bold> shows the true albedo, <bold>(g)</bold> the albedo prediction and its residual, and <bold>(h)</bold> the melt prediction and its residual of Multitarget NN.</p></caption>
            <graphic xlink:href="https://tc.copernicus.org/articles/20/3313/2026/tc-20-3313-2026-f04.jpg"/>

          </fig>

</sec>
<sec id="Ch1.S3.SS1.SSS2">
  <label>3.1.2</label><title>Qualitative assessment</title>
      <p id="d2e3267">But what causes these over- and underestimations? For this, we look at a typical day from the peak melt season in the test set in more detail. Figure <xref ref-type="fig" rid="F4"/> shows true and predicted melt, alongside residuals, for different models for a day in July. The melt maps are given together with the daily melt extent (ME) in km<sup>2</sup>, whereby we only consider melt  <inline-formula><mml:math id="M183" display="inline"><mml:mo>&gt;</mml:mo></mml:math></inline-formula> 1 mm w.e. per day, and its median and IQR in mm w.e. per day.</p>
      <p id="d2e3288">Modular NN predicts spatially over-smoothed melt fields, evident in both the predicted field itself and the residual map in Fig. <xref ref-type="fig" rid="F4"/>b, which also leads to a larger ME. Autoreg NN in teacher-forced mode (Fig. <xref ref-type="fig" rid="F4"/>d) shows substantial improvement in the spatial structure, with more pronounced contours in the predictions. However, the true previous melt used to make teacher-forced predictions is not available when applying the emulator to new climate data, where the model is run in inference mode, using its own prediction of the previous day. While the predicted melt field in inference mode (Fig. <xref ref-type="fig" rid="F4"/>c) still exhibits sharper contours compared to Modular NN, the residual plot shows that there is uncertainty on where exactly these sharp contours should be.</p>
      <p id="d2e3297">Modular NN with <inline-formula><mml:math id="M184" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> as an additional input shows that knowledge of surface albedo improves the spatial patterns significantly (Fig. <xref ref-type="fig" rid="F4"/>e). This suggests that the autoregressive element's importance lies mainly in its role as a proxy for surface albedo, which in turn is also an indicator of melt presence. However, as noted above, this configuration cannot be used for predictive applications. The alternative model Multitarget NN, which uses albedo as an auxiliary target instead does not lead to noteworthy improvement of melt predictions (Fig. <xref ref-type="fig" rid="F4"/>h). Unfortunately, the predicted albedo fields are also over-smoothed (Fig. <xref ref-type="fig" rid="F4"/>f, g), with the residual map closely mirroring the spatial patterns of the melt residuals (Fig. <xref ref-type="fig" rid="F4"/>h).</p>
      <p id="d2e3315">Daily maps for additional days throughout the year are shown in Appendix <xref ref-type="sec" rid="App1.Ch1.S5"/>, with the maps for a day in June (Fig. <xref ref-type="fig" rid="FE3"/>) and August (Fig. <xref ref-type="fig" rid="FE4"/>) showing similar spatial structures as those for July in Fig. <xref ref-type="fig" rid="F4"/>. These results reveal the fundamental challenge: the simultaneous over- and underestimations arise from the model's inability to accurately reconstruct the sharp spatial structures of the surface state, which are present at the daily scale. Without explicit knowledge of surface albedo, the model produces smoothed fields that systematically overestimate melt in some locations while at the same time underestimating it in others, creating the characteristic spatial error pattern observed in the residuals.</p>
</sec>
</sec>
<sec id="Ch1.S3.SS2">
  <label>3.2</label><title>Evaluation of the systematic input selection study</title>
      <p id="d2e3335">Since Modular NN, Autoreg NN, and Multitarget NN show very similar performance on the test set, we perform the systematic input selection study based on Modular NN as it is the least complex model among these three. Table <xref ref-type="table" rid="T4"/> lists the performance of the Modular NN using different input subsets defined in Table <xref ref-type="table" rid="T2"/> and Sect. <xref ref-type="sec" rid="Ch1.S2.SS2.SSS4"/>. The metrics are given for the respectively best model from the tuning process (Appendix <xref ref-type="sec" rid="App1.Ch1.S3"/>: Table <xref ref-type="table" rid="TC2"/>; Fig. <xref ref-type="fig" rid="FC1"/>b) on the test set.</p>
      <p id="d2e3351">Modular NN EBMT<sub><italic>d</italic></sub> does not show any significant improvement on the test scores compared to Modular NN with the default input set. However, Fig. <xref ref-type="fig" rid="F5"/> shows a better correspondence between true and predicted melt for the sparse high melt values compared to Modular NN (Fig. <xref ref-type="fig" rid="F3"/>c). Especially the previously discussed melt overestimation due to the unusually high sensible heat flux (Fig. <xref ref-type="fig" rid="FD1"/>) does not happen anymore when using the energy inputs summed up, proving the stabilizing effect of using the energy terms summed up instead of separately.</p>
      <p id="d2e3369">Modular NN EBMT shows that including all atmospheric variables not just in the short-term module but their 10 year averages also in the long-term module improves model performance from RMSE of <inline-formula><mml:math id="M186" display="inline"><mml:mrow class="unit"><mml:mn mathvariant="normal">0.90</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">mm</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">w</mml:mi><mml:mo>.</mml:mo><mml:mi mathvariant="normal">e</mml:mi><mml:mo>.</mml:mo><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="normal">per</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="normal">day</mml:mi></mml:mrow></mml:math></inline-formula> to <inline-formula><mml:math id="M187" display="inline"><mml:mrow class="unit"><mml:mn mathvariant="normal">0.85</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">mm</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">w</mml:mi><mml:mo>.</mml:mo><mml:mi mathvariant="normal">e</mml:mi><mml:mo>.</mml:mo><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="normal">per</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="normal">day</mml:mi></mml:mrow></mml:math></inline-formula>, compared to using only snowfall and temperature as long-term conditions proxies. Removing <inline-formula><mml:math id="M188" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula> from the inputs (Modular NN EBM) improves performance on the test set only slightly. Having rainfall as inputs improves model performance slightly, since the metrics worsened for Modular NN EBS. Snowfall on the other hand is a very critical input, as RMSE increases by <inline-formula><mml:math id="M189" display="inline"><mml:mrow class="unit"><mml:mn mathvariant="normal">0.37</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">mm</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">w</mml:mi><mml:mo>.</mml:mo><mml:mi mathvariant="normal">e</mml:mi><mml:mo>.</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="normal">per</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="normal">day</mml:mi></mml:mrow></mml:math></inline-formula> if the model is trained without snowfall (Modular NN EBR). While the relation between true and predicted values for all subsets look very similar, Modular NN EBR shows increased amount of small deviations up to 50 mm w.e. per day (wider spread of red band in Fig. <xref ref-type="fig" rid="FF1"/>e compared to a–d, f).</p>
      <p id="d2e3462">Thus, best model is achieved with EB terms (with LW<sup>↓</sup>, LHF, and SHF summed up, and SW<sup>↓</sup> separate) and mass terms (especially snowfall) on a daily resolution for the current and the past 9 d, and additionally as 10 year averages. Furthermore, seasonal encoding is critical. Including air temperature <inline-formula><mml:math id="M192" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula>, which often serves as melt proxy, cannot improve model performance any further. In contrast, including <inline-formula><mml:math id="M193" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula> increases model size by 11 additional input features, and it can impair model interpretability since <inline-formula><mml:math id="M194" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula> is not a causal input to melt production, but a confounding variable for turbulent heat fluxes and melt. Moreover, while <inline-formula><mml:math id="M195" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula> does impact the test scores only slightly, the generalizability to future climate projections could be affected by including <inline-formula><mml:math id="M196" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula> as an input variable. Melt models based on <inline-formula><mml:math id="M197" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula> as proxies often under- or overestimate melt under a warming climate, as they do not consistently include melt-albedo feedback, changed cloud formation and thus change in radiative inputs <xref ref-type="bibr" rid="bib1.bibx17 bib1.bibx5" id="paren.41"/>. While the applicability of ML models to future projections must always be evaluated separately, air temperature could rather be a confounding variable decreasing model performance and transferability to different climates, than being a useful predictor.</p>

<table-wrap id="T4"><label>Table 4</label><caption><p id="d2e3533">Performance of final models on the test set for the default Modular NN and its variations using different input variable subsets consisting of energy balance terms (EB), mass terms <inline-formula><mml:math id="M198" display="inline"><mml:mi>M</mml:mi></mml:math></inline-formula> (rainfall (<inline-formula><mml:math id="M199" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>) and snowfall (<inline-formula><mml:math id="M200" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>)), temperature <inline-formula><mml:math id="M201" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula>, and seasonality encoding (with noDOY indicating seasonality encoding was removed as input). RMSE, MAE, and MBE in mm w.e. per day; <inline-formula><mml:math id="M202" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> the standard coefficient of determination, and <inline-formula><mml:math id="M203" display="inline"><mml:mrow><mml:msubsup><mml:mi>R</mml:mi><mml:mtext>anom</mml:mtext><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula> is relative to anomalies from climatology.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="6">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">RMSE</oasis:entry>
         <oasis:entry colname="col3">MAE</oasis:entry>
         <oasis:entry colname="col4">MBE</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M204" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M205" display="inline"><mml:mrow><mml:msubsup><mml:mi>R</mml:mi><mml:mtext>anom</mml:mtext><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Climatology</oasis:entry>
         <oasis:entry colname="col2">2.30</oasis:entry>
         <oasis:entry colname="col3">0.56</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M206" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.17</oasis:entry>
         <oasis:entry colname="col5">0.78</oasis:entry>
         <oasis:entry colname="col6">–</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Modular NN</oasis:entry>
         <oasis:entry colname="col2">0.90</oasis:entry>
         <oasis:entry colname="col3">0.18</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M207" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.01</oasis:entry>
         <oasis:entry colname="col5">0.97</oasis:entry>
         <oasis:entry colname="col6">0.85</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"> EBMT<sub><italic>d</italic></sub></oasis:entry>
         <oasis:entry colname="col2">0.90</oasis:entry>
         <oasis:entry colname="col3">0.17</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M209" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.00</oasis:entry>
         <oasis:entry colname="col5">0.97</oasis:entry>
         <oasis:entry colname="col6">0.85</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"> EBMT</oasis:entry>
         <oasis:entry colname="col2">0.85</oasis:entry>
         <oasis:entry colname="col3">0.16</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M210" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.01</oasis:entry>
         <oasis:entry colname="col5">0.97</oasis:entry>
         <oasis:entry colname="col6">0.86</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"> <italic>EBM</italic></oasis:entry>
         <oasis:entry colname="col2">0.84</oasis:entry>
         <oasis:entry colname="col3">0.16</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M211" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.00</oasis:entry>
         <oasis:entry colname="col5">0.97</oasis:entry>
         <oasis:entry colname="col6">0.87</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"> EBS</oasis:entry>
         <oasis:entry colname="col2">0.87</oasis:entry>
         <oasis:entry colname="col3">0.16</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M212" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.01</oasis:entry>
         <oasis:entry colname="col5">0.97</oasis:entry>
         <oasis:entry colname="col6">0.86</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"> EBR</oasis:entry>
         <oasis:entry colname="col2">1.21</oasis:entry>
         <oasis:entry colname="col3">0.25</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M213" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.01</oasis:entry>
         <oasis:entry colname="col5">0.94</oasis:entry>
         <oasis:entry colname="col6">0.72</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"> EBM<sub>noDOY</sub></oasis:entry>
         <oasis:entry colname="col2">0.93</oasis:entry>
         <oasis:entry colname="col3">0.18</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M215" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.00</oasis:entry>
         <oasis:entry colname="col5">0.96</oasis:entry>
         <oasis:entry colname="col6">0.84</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <fig id="F5"><label>Figure 5</label><caption><p id="d2e3899">2D hexagonal binning plots of true versus predicted surface melt of the test set of Modular NN EBMT<sub><italic>d</italic></sub>. The logarithmic color bar is valid for bins containing up to <inline-formula><mml:math id="M217" display="inline"><mml:mrow><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">5</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> data points; bins containing more than <inline-formula><mml:math id="M218" display="inline"><mml:mrow><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">5</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> points are indicated in black for better visibility.</p></caption>
          <graphic xlink:href="https://tc.copernicus.org/articles/20/3313/2026/tc-20-3313-2026-f05.jpg"/>

        </fig>

<sec id="Ch1.S3.SS2.SSS1">
  <label>3.2.1</label><title>Spatial-temporal residual distribution</title>
      <p id="d2e3946">In the remainder of this section we investigate the performance of Modular NN EBM spatially and temporally. This analysis includes predictions from both the test year 2016 to show the error patterns of a single independent year and the entire period 1990–2016 to investigate persistent systematic biases of our model.</p>
      <p id="d2e3949">Figure <xref ref-type="fig" rid="F6"/> shows the seasonally aggregated melt for the test set, starting from December 2015 to November 2016. The overestimated melt extent and lower median of melt predictions compared to true melt in the daily maps for Modular NN (Fig. <xref ref-type="fig" rid="F4"/>, Appendix <xref ref-type="sec" rid="App1.Ch1.S5"/>) is also visible in the seasonal aggregates of Modular NN EBM across all seasons. The south-west coast shows some underestimation in surface melt in spring (MAM, Fig. <xref ref-type="fig" rid="F6"/>b), which intensifies in summer (JJA, Fig. <xref ref-type="fig" rid="F6"/>c). In contrast the north and north-west coastal region shows overestimation in summer. However, Fig. <xref ref-type="fig" rid="F7"/> shows that this melt overestimation in JJA is a behavior more specific to the test year, rather than a characteristic of the model Modular NN EBM. The residual plot for JJA averaged across 1990–2016 shows systematic underestimation of melt within the whole ablation zone, and some overestimation at the transition between ablation and percolation zone especially in the north and north-west (compare GrIS zones Fig. <xref ref-type="fig" rid="FB1"/>). Together with the superior performance of Modular NN with <inline-formula><mml:math id="M219" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> this indicated that the underestimation stems from the lowered albedo for shallow snow depths and bare ice. The long-term variables using 10 year averages in combination with the daily variables for in total 10 days are not enough to derive snow depth and thus decreased albedo due to bare ice exposure. Including variables at a monthly or quarterly scale leading up to the prediction day might help in approximating snow depth and associated albedo.</p>

      <fig id="F6" specific-use="star"><label>Figure 6</label><caption><p id="d2e3976">True melt alongside melt predictions and residuals of Modular EBM seasonally aggregated for the test set, but with December in <bold>(a)</bold> taken from 2015. <bold>(a)</bold> December, January, February; <bold>(b)</bold> March, April, May; <bold>(c)</bold> June, July, August; <bold>(d)</bold> September, October, November. Note that color scales are adjusted individually for each subplot.</p></caption>
            <graphic xlink:href="https://tc.copernicus.org/articles/20/3313/2026/tc-20-3313-2026-f06.jpg"/>

          </fig>

      <fig id="F7" specific-use="star"><label>Figure 7</label><caption><p id="d2e4003">True melt alongside melt predictions and residuals of Modular EBM seasonally aggregated per year, and averaged across 1990–2016. Note that color scales are adjusted individually for each subplot.</p></caption>
            <graphic xlink:href="https://tc.copernicus.org/articles/20/3313/2026/tc-20-3313-2026-f07.jpg"/>

          </fig>

      <p id="d2e4012">In the remainder of this section we investigate the performance of Modular NN EBM at the basin level. To enable performance comparison across the six basins, we present the mean annual melt and performance scores for each basin and the whole ice sheet in Table <xref ref-type="table" rid="T5"/>. The melt amount and the scores are given for the test year 2016, together with the mean <inline-formula><mml:math id="M220" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> standard deviation of the single years for the entire period 1990–2016. Figure <xref ref-type="fig" rid="F8"/> shows basin-wise integrated true and predicted melt for the test year alongside their climatologies, together with daily over- and estimation throughout the annual cycle for the test year (middle rows) and averaged over all years 1990–2016 (bottom rows).</p>
      <p id="d2e4026">Basins SE and SW show the highest melt with respect to their basin area, and also the highest year-to-year fluctuations in melt amount, followed by basin N (Table <xref ref-type="table" rid="T5"/>). Correlation <inline-formula><mml:math id="M221" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> between true and predicted melt is high across all basins, with the northern basin having slightly lower <inline-formula><mml:math id="M222" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> (0.95–0.97) than the southern basins (0.98). The <inline-formula><mml:math id="M223" display="inline"><mml:mrow><mml:msubsup><mml:mi>R</mml:mi><mml:mtext>anom</mml:mtext><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula> computed on anomalies from the climatology is highest for the southern basins SE and SW (0.93 and 0.92), with low standard deviation of 0.02. In contrast, the northern basins N and NE show <inline-formula><mml:math id="M224" display="inline"><mml:mrow><mml:msubsup><mml:mi>R</mml:mi><mml:mtext>anom</mml:mtext><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula> of 0.78 and 0.75 respectively, paired with higher standard deviations of 0.07 and 0.11 respectively. This indicates that the emulator captures variability with respect to the climatology particularly well in the SW and SE basins, which have stronger anomalies that the model can learn, while the other basins show lower anomaly skill.</p>
      <p id="d2e4079">In accordance with that, the RMSE mean and standard deviations are highest for basins N and NE compared to the other basins. Also, as with the whole ice sheet, MAE is significantly lower than RMSE for all basins, indicating that a few large residuals persist across all regions. The MAE-to-RMSE ratio indicates that basins NE and NW are affected most by a small number of high residuals, while basins SW and SE are the least dominated by such outliers, and more by small and medium errors.</p>
      <p id="d2e4082">Basin SE is the only one with a positive bias on average (0.01), while the other basins show zero or slightly negative MBE of <inline-formula><mml:math id="M225" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.02 at most on average across the years. However, the respective standard deviations across the years are bigger than the absolute means of the biases, i.e. no basin has a systematic bias across the years. Basins N and SW show the largest fluctuations in bias with a standard deviation of 0.05, while they also show high average melt together with high melt variability relative to mean melt amount (27 %), and thus these basins are particularly variable and hard to learn.</p>
      <p id="d2e4092">Figure <xref ref-type="fig" rid="F8"/> shows that not only is there no systematic bias across the years, but also that over- and underestimating melt is largely synchronous, thus there are also no pronounced systematic temporal biases. However, basins N and NW show a tendency toward early-season underestimation and late-season overestimation, while basin NE shows a dominance of underestimation throughout the entire year.</p>
      <p id="d2e4098">2016 is a relatively high melt year for each basin, but within one standard deviation from the respective means. Basins N and NW show anomalous positive MBE of 0.06 and 0.05 respectively, while basin SW shows strong negative bias of <inline-formula><mml:math id="M226" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.06, such that the biases cancel each other out across the whole GrIS. Figure <xref ref-type="fig" rid="F8"/> shows enhanced melt at peak melt season for basins N and NE, causing more melt overestimation than normal, and higher <inline-formula><mml:math id="M227" display="inline"><mml:mrow><mml:msubsup><mml:mi>R</mml:mi><mml:mtext>anom</mml:mtext><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula>. Basin NW also shows high melt at peak season, but then a sudden decrease end of July, and thus the melt amount deviates significantly from the climatology which leads to a <inline-formula><mml:math id="M228" display="inline"><mml:mrow><mml:msubsup><mml:mi>R</mml:mi><mml:mtext>anom</mml:mtext><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula> 1.5 standard deviation lower than for the average year. Also SW and to a lesser extent CE and SE have lower <inline-formula><mml:math id="M229" display="inline"><mml:mrow><mml:msubsup><mml:mi>R</mml:mi><mml:mtext>anom</mml:mtext><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula> than the average year as they show a lot of temporal fluctuation in the melt (top panels in Fig. <xref ref-type="fig" rid="F8"/> SW and SE), also starting with some early melt. For SW and SE the RMSE and MAE are also more than a standard deviation above the average.</p>
      <p id="d2e4152">These deviations in 2016 highlight the model's challenges in capturing anomalous melt patterns that deviate substantially from the climatological mean, particularly in basins with pronounced seasonal deviations such as NW's abrupt late-summer decline and early melt onset in basins SW and SE. However, while individual basin biases in 2016 are notable (positive for N and NW, negative for SW), these deviations appear to be year-specific rather than systematic. Averaging across the entire 1990–2016 period reveals that the model exhibits no persistent systematic biases, with year-to-year fluctuations in error patterns largely balanced over time. Although based on the performance including the training set, this temporal stability suggests that the emulator's performance is robust across the multi-decadal timescale.</p>

<table-wrap id="T5" specific-use="star"><label>Table 5</label><caption><p id="d2e4158">Performance of Modular NN EBM for predicting surface melt per basin and over the entire GrIS for the test year 2016, together with mean <inline-formula><mml:math id="M230" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> standard deviation across the full period 1990–2016. Spatial mean annual melt, RMSE, MAE, and MBE in mm w.e. per day.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="7">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:colspec colnum="7" colname="col7" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">mean melt</oasis:entry>
         <oasis:entry colname="col3">RMSE</oasis:entry>
         <oasis:entry colname="col4">MAE</oasis:entry>
         <oasis:entry colname="col5">MBE</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M231" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M232" display="inline"><mml:mrow><mml:msubsup><mml:mi>R</mml:mi><mml:mtext>anom</mml:mtext><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">N</oasis:entry>
         <oasis:entry colname="col2">409</oasis:entry>
         <oasis:entry colname="col3">0.92</oasis:entry>
         <oasis:entry colname="col4">0.17</oasis:entry>
         <oasis:entry colname="col5">0.06</oasis:entry>
         <oasis:entry colname="col6">0.97</oasis:entry>
         <oasis:entry colname="col7">0.81</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(<inline-formula><mml:math id="M233" display="inline"><mml:mrow><mml:mn mathvariant="normal">350</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">95</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col3">(<inline-formula><mml:math id="M234" display="inline"><mml:mrow><mml:mn mathvariant="normal">1.02</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.15</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col4">(<inline-formula><mml:math id="M235" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.17</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.04</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col5">(<inline-formula><mml:math id="M236" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.01</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.05</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col6">(<inline-formula><mml:math id="M237" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.95</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.02</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col7">(<inline-formula><mml:math id="M238" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.78</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.07</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">NE</oasis:entry>
         <oasis:entry colname="col2">300</oasis:entry>
         <oasis:entry colname="col3">0.73</oasis:entry>
         <oasis:entry colname="col4">0.12</oasis:entry>
         <oasis:entry colname="col5">-0.00</oasis:entry>
         <oasis:entry colname="col6">0.97</oasis:entry>
         <oasis:entry colname="col7">0.84</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(<inline-formula><mml:math id="M239" display="inline"><mml:mrow><mml:mn mathvariant="normal">235</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">59</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col3">(<inline-formula><mml:math id="M240" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.83</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.15</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col4">(<inline-formula><mml:math id="M241" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.12</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.03</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col5">(<inline-formula><mml:math id="M242" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.02</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.04</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col6">(<inline-formula><mml:math id="M243" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.95</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.02</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col7">(<inline-formula><mml:math id="M244" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.75</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.11</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">NW</oasis:entry>
         <oasis:entry colname="col2">310</oasis:entry>
         <oasis:entry colname="col3">0.81</oasis:entry>
         <oasis:entry colname="col4">0.13</oasis:entry>
         <oasis:entry colname="col5">0.05</oasis:entry>
         <oasis:entry colname="col6">0.96</oasis:entry>
         <oasis:entry colname="col7">0.82</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(<inline-formula><mml:math id="M245" display="inline"><mml:mrow><mml:mn mathvariant="normal">284</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">84</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col3">(<inline-formula><mml:math id="M246" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.72</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.12</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col4">(<inline-formula><mml:math id="M247" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.11</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.03</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col5">(<inline-formula><mml:math id="M248" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.01</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.03</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col6">(<inline-formula><mml:math id="M249" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.97</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.01</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col7">(<inline-formula><mml:math id="M250" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.88</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.04</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">CE</oasis:entry>
         <oasis:entry colname="col2">312</oasis:entry>
         <oasis:entry colname="col3">0.73</oasis:entry>
         <oasis:entry colname="col4">0.13</oasis:entry>
         <oasis:entry colname="col5">-0.01</oasis:entry>
         <oasis:entry colname="col6">0.96</oasis:entry>
         <oasis:entry colname="col7">0.83</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(<inline-formula><mml:math id="M251" display="inline"><mml:mrow><mml:mn mathvariant="normal">273</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">60</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col3">(<inline-formula><mml:math id="M252" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.72</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.09</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col4">(<inline-formula><mml:math id="M253" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.12</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.02</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col5">(<inline-formula><mml:math id="M254" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.00</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.03</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col6">(<inline-formula><mml:math id="M255" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.96</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.01</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col7">(<inline-formula><mml:math id="M256" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.85</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.04</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SW</oasis:entry>
         <oasis:entry colname="col2">525</oasis:entry>
         <oasis:entry colname="col3">0.96</oasis:entry>
         <oasis:entry colname="col4">0.21</oasis:entry>
         <oasis:entry colname="col5">-0.06</oasis:entry>
         <oasis:entry colname="col6">0.97</oasis:entry>
         <oasis:entry colname="col7">0.89</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(<inline-formula><mml:math id="M257" display="inline"><mml:mrow><mml:mn mathvariant="normal">442</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">122</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col3">(<inline-formula><mml:math id="M258" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.78</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.13</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col4">(<inline-formula><mml:math id="M259" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.15</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.03</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col5">(<inline-formula><mml:math id="M260" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.01</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.05</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col6">(<inline-formula><mml:math id="M261" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.98</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.01</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col7">(<inline-formula><mml:math id="M262" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.92</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.02</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SE</oasis:entry>
         <oasis:entry colname="col2">608</oasis:entry>
         <oasis:entry colname="col3">0.88</oasis:entry>
         <oasis:entry colname="col4">0.20</oasis:entry>
         <oasis:entry colname="col5">-0.02</oasis:entry>
         <oasis:entry colname="col6">0.97</oasis:entry>
         <oasis:entry colname="col7">0.92</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(<inline-formula><mml:math id="M263" display="inline"><mml:mrow><mml:mn mathvariant="normal">548</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">111</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col3">(<inline-formula><mml:math id="M264" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.76</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.08</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col4">(<inline-formula><mml:math id="M265" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.17</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.02</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col5">(<inline-formula><mml:math id="M266" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.01</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.02</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col6">(<inline-formula><mml:math id="M267" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.98</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.01</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col7">(<inline-formula><mml:math id="M268" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.93</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.02</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">GrIS</oasis:entry>
         <oasis:entry colname="col2">403</oasis:entry>
         <oasis:entry colname="col3">0.84</oasis:entry>
         <oasis:entry colname="col4">0.16</oasis:entry>
         <oasis:entry colname="col5">-0.00</oasis:entry>
         <oasis:entry colname="col6">0.97</oasis:entry>
         <oasis:entry colname="col7">0.87</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(<inline-formula><mml:math id="M269" display="inline"><mml:mrow><mml:mn mathvariant="normal">345</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">78</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col3">(<inline-formula><mml:math id="M270" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.81</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.09</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col4">(<inline-formula><mml:math id="M271" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.14</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.02</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col5">(<inline-formula><mml:math id="M272" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.01</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.03</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col6">(<inline-formula><mml:math id="M273" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.97</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.01</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col7">(<inline-formula><mml:math id="M274" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.87</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.03</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <fig id="F8" specific-use="star"><label>Figure 8</label><caption><p id="d2e5101">Temporal distribution of over- and underestimations of Modular NN EBM per basin. The respective upper subplots show the basin-wise total true and predicted surface melt for the test year together with the true and predicted melt climatologies (1990–2013). The middle subplots show the total amount of overestimated (red) and underestimated (blue) melt per basins for the test year. The lower subplots show the average annual overestimated and underestimated melt for the whole time period 1990–2016.</p></caption>
            <graphic xlink:href="https://tc.copernicus.org/articles/20/3313/2026/tc-20-3313-2026-f08.png"/>

          </fig>

</sec>
</sec>
</sec>
<sec id="Ch1.S4" sec-type="conclusions">
  <label>4</label><title>Conclusions</title>
      <p id="d2e5120">We have developed a machine learning emulator that successfully predicts daily surface melt on the Greenland ice sheet from atmospheric variables alone. By training a neural network on 24 years of output from the polar regional climate model HIRHAM5 and its firn model DMIHH, we demonstrate that surface melt can be accurately emulated with a mean absolute error of 0.16 mm w.e. per day (Table <xref ref-type="table" rid="T4"/>), significantly outperforming climatological benchmarks. Basin-level evaluation demonstrates that our location-agnostic approach generalizes well across the diverse climatic regimes of Greenland. The emulator maintains high correlation across all six major basins with minimal systematic bias (Table <xref ref-type="table" rid="T5"/>).</p>
      <p id="d2e5127">Our iterative model development reveals several key insights about the role of temporal information. Including atmospheric conditions from the previous nine days substantially improves performance over using only current-day conditions, demonstrating that short-term history matters. Furthermore, long-term climate memory in the form of decadal averages of temperature and snowfall improve model performance by providing crucial information about location-specific firn characteristics that affect the surface energy balance. Thus, the model profits from short- and long-term memory from these past conditions. Beyond the climate forcings that directly impact SEB, i.e., turbulent heat fluxes and radiation, also snowfall and seasonality encoding are crucial input parameters. Rainfall does improve model performance but only slightly, while air temperature does not contribute to any additional performance increase, and should therefore be excluded from emulation approaches particularly when the model is combined with explainability algorithms, where redundant variables can obscure interpretability <xref ref-type="bibr" rid="bib1.bibx33 bib1.bibx24" id="paren.42"/>.</p>
      <p id="d2e5133">The predicted melt fields tend to be spatially over-smoothed compared to the firn model output, lacking sharp transitions between regions of different melt extent which are a consequence of the albedo scheme using lower albedo for bare ice and small snow depths. Neither an autoregressive approach (Autoreg NN) nor learning albedo as an auxiliary target (Multitarget NN) did improve this over-smoothing. Therefore, capturing the sharp spatial gradients emerging from different surface conditions remains a fundamental challenge for data-driven approaches using atmospheric input data only. Future work should explore whether incorporating more versatile historical information, such as accumulation and energy input from preceding months, can help differentiate high versus low snow regimes. The neural network architecture could be extended with a module operating on monthly timescales, or more sophisticated approaches for timeseries modeling, such as LSTMs <xref ref-type="bibr" rid="bib1.bibx21" id="paren.43"/> or transformer-based architectures <xref ref-type="bibr" rid="bib1.bibx53" id="paren.44"/>, could be investigated.</p>
      <p id="d2e5142">Additionally, the domain of applicability must be determined and extended. The emulator is developed on HIRHAM5 reanalysis data and trained to emulate DMIHH firn model behavior. Extrapolation beyond the training distribution yields unreliable results in data-driven approaches, and it is unclear to what extent different simulations deviate in their data distribution and how sensitive the emulator is to these changes. Applying the emulator to climate data under different forcings, from different time periods, or from entirely different polar RCMs thus likely requires retraining. Furthermore, in the future the emulator can be extended to predict additional firn model outputs, such as runoff, creating a more comprehensive tool for Greenland SMB estimation.</p>
      <p id="d2e5147">In conclusion, this work demonstrates that machine learning can successfully emulate firn model behavior with spatially and temporally consistent accuracy and computational efficiency, while also revealing fundamental challenges in capturing sharp spatial patterns driven by surface characteristics. This emulator, when coupled with downscaling emulators that bridge the gap between global climate models and regional applications, enables the large ensemble projections needed to quantify uncertainty both within individual RCMs and across the divergent projections from different polar RCMs. Furthermore, such a firn emulator can be used as a surrogate model for SMB processes in Earth system models, enabling interactive ice sheet-climate coupling at scales previously computationally infeasible.</p>
</sec>

      
      </body>
    <back><app-group>

<app id="App1.Ch1.S1">
  <label>Appendix A</label><title>Characteristics of train, validation, and test years</title>
      <p id="d2e5163">Table <xref ref-type="table" rid="TA1"/> shows aggregated melt characteristics per year compared to the climatology over 1990-2013. The total melt of the validation year (<inline-formula><mml:math id="M275" display="inline"><mml:mrow class="unit"><mml:mn mathvariant="normal">650</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">Gt</mml:mi></mml:mrow></mml:math></inline-formula>) and the test year (<inline-formula><mml:math id="M276" display="inline"><mml:mrow class="unit"><mml:mn mathvariant="normal">726</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">Gt</mml:mi></mml:mrow></mml:math></inline-formula>) lie well within the historical range of the training climatology (294-<inline-formula><mml:math id="M277" display="inline"><mml:mrow class="unit"><mml:mn mathvariant="normal">1059</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">Gt</mml:mi></mml:mrow></mml:math></inline-formula>) and are comparable to several years in the record (e.g., 1995, 2002, 2011, and 1998, 2003, 2005, 2007, 2008 respectively), deviating from the climatological mean by less than 0.75 standard deviations. The MBE exhibits a clear shift from predominantly negative values (until 2002) to positive values thereafter. Despite this overall change, the deviation metrics w.r.t. the climatological melt for 2014 and 2016 align well with other years in the dataset, suggesting these are climatologically typical years rather than anomalous ones. This assessment is supported by the Arctic reports <xref ref-type="bibr" rid="bib1.bibx23 bib1.bibx39" id="text.45"/>, which document that 2014 and 2016 reflect the increasing melting trend without reaching the extremes of previous record years.</p>

<table-wrap id="TA1"><label>Table A1</label><caption><p id="d2e5208">Total melt (in Gt per year); RMSE, MAE and MBE (in mm w.e. per day) relative to the climatology over 1990–2013, and the coefficient of determination between the specific year and the climatology (<inline-formula><mml:math id="M278" display="inline"><mml:mrow><mml:msubsup><mml:mi>R</mml:mi><mml:mtext>anom</mml:mtext><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula>).</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="6">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">melt (Gt)</oasis:entry>
         <oasis:entry colname="col3">RMSE</oasis:entry>
         <oasis:entry colname="col4">MAE</oasis:entry>
         <oasis:entry colname="col5">MBE</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M279" display="inline"><mml:mrow><mml:msubsup><mml:mi>R</mml:mi><mml:mtext>anom</mml:mtext><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">1990</oasis:entry>
         <oasis:entry colname="col2">576</oasis:entry>
         <oasis:entry colname="col3">2.08</oasis:entry>
         <oasis:entry colname="col4">0.50</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M280" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.06</oasis:entry>
         <oasis:entry colname="col6">0.77</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">1991</oasis:entry>
         <oasis:entry colname="col2">547</oasis:entry>
         <oasis:entry colname="col3">2.11</oasis:entry>
         <oasis:entry colname="col4">0.52</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M281" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.10</oasis:entry>
         <oasis:entry colname="col6">0.74</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">1992</oasis:entry>
         <oasis:entry colname="col2">294</oasis:entry>
         <oasis:entry colname="col3">2.48</oasis:entry>
         <oasis:entry colname="col4">0.60</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M282" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.48</oasis:entry>
         <oasis:entry colname="col6">0.28</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">1993</oasis:entry>
         <oasis:entry colname="col2">570</oasis:entry>
         <oasis:entry colname="col3">1.93</oasis:entry>
         <oasis:entry colname="col4">0.48</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M283" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.07</oasis:entry>
         <oasis:entry colname="col6">0.78</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">1994</oasis:entry>
         <oasis:entry colname="col2">476</oasis:entry>
         <oasis:entry colname="col3">1.96</oasis:entry>
         <oasis:entry colname="col4">0.49</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M284" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.21</oasis:entry>
         <oasis:entry colname="col6">0.74</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">1995</oasis:entry>
         <oasis:entry colname="col2">626</oasis:entry>
         <oasis:entry colname="col3">2.50</oasis:entry>
         <oasis:entry colname="col4">0.58</oasis:entry>
         <oasis:entry colname="col5">0.02</oasis:entry>
         <oasis:entry colname="col6">0.71</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">1996</oasis:entry>
         <oasis:entry colname="col2">456</oasis:entry>
         <oasis:entry colname="col3">2.15</oasis:entry>
         <oasis:entry colname="col4">0.53</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M285" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.24</oasis:entry>
         <oasis:entry colname="col6">0.65</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">1997</oasis:entry>
         <oasis:entry colname="col2">519</oasis:entry>
         <oasis:entry colname="col3">2.25</oasis:entry>
         <oasis:entry colname="col4">0.54</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M286" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.15</oasis:entry>
         <oasis:entry colname="col6">0.69</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">1998</oasis:entry>
         <oasis:entry colname="col2">709</oasis:entry>
         <oasis:entry colname="col3">2.32</oasis:entry>
         <oasis:entry colname="col4">0.55</oasis:entry>
         <oasis:entry colname="col5">0.14</oasis:entry>
         <oasis:entry colname="col6">0.76</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">1999</oasis:entry>
         <oasis:entry colname="col2">546</oasis:entry>
         <oasis:entry colname="col3">2.22</oasis:entry>
         <oasis:entry colname="col4">0.54</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M287" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.11</oasis:entry>
         <oasis:entry colname="col6">0.72</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2000</oasis:entry>
         <oasis:entry colname="col2">548</oasis:entry>
         <oasis:entry colname="col3">2.46</oasis:entry>
         <oasis:entry colname="col4">0.57</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M288" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.10</oasis:entry>
         <oasis:entry colname="col6">0.68</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2001</oasis:entry>
         <oasis:entry colname="col2">502</oasis:entry>
         <oasis:entry colname="col3">2.05</oasis:entry>
         <oasis:entry colname="col4">0.50</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M289" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.17</oasis:entry>
         <oasis:entry colname="col6">0.73</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2002</oasis:entry>
         <oasis:entry colname="col2">686</oasis:entry>
         <oasis:entry colname="col3">2.47</oasis:entry>
         <oasis:entry colname="col4">0.58</oasis:entry>
         <oasis:entry colname="col5">0.11</oasis:entry>
         <oasis:entry colname="col6">0.73</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2003</oasis:entry>
         <oasis:entry colname="col2">732</oasis:entry>
         <oasis:entry colname="col3">2.62</oasis:entry>
         <oasis:entry colname="col4">0.59</oasis:entry>
         <oasis:entry colname="col5">0.18</oasis:entry>
         <oasis:entry colname="col6">0.74</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2004</oasis:entry>
         <oasis:entry colname="col2">596</oasis:entry>
         <oasis:entry colname="col3">2.05</oasis:entry>
         <oasis:entry colname="col4">0.51</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M290" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.03</oasis:entry>
         <oasis:entry colname="col6">0.77</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2005</oasis:entry>
         <oasis:entry colname="col2">723</oasis:entry>
         <oasis:entry colname="col3">2.50</oasis:entry>
         <oasis:entry colname="col4">0.60</oasis:entry>
         <oasis:entry colname="col5">0.17</oasis:entry>
         <oasis:entry colname="col6">0.74</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2006</oasis:entry>
         <oasis:entry colname="col2">592</oasis:entry>
         <oasis:entry colname="col3">2.49</oasis:entry>
         <oasis:entry colname="col4">0.59</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M291" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.04</oasis:entry>
         <oasis:entry colname="col6">0.68</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2007</oasis:entry>
         <oasis:entry colname="col2">740</oasis:entry>
         <oasis:entry colname="col3">2.43</oasis:entry>
         <oasis:entry colname="col4">0.58</oasis:entry>
         <oasis:entry colname="col5">0.19</oasis:entry>
         <oasis:entry colname="col6">0.76</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2008</oasis:entry>
         <oasis:entry colname="col2">723</oasis:entry>
         <oasis:entry colname="col3">2.55</oasis:entry>
         <oasis:entry colname="col4">0.59</oasis:entry>
         <oasis:entry colname="col5">0.16</oasis:entry>
         <oasis:entry colname="col6">0.75</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2009</oasis:entry>
         <oasis:entry colname="col2">571</oasis:entry>
         <oasis:entry colname="col3">2.12</oasis:entry>
         <oasis:entry colname="col4">0.53</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M292" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.07</oasis:entry>
         <oasis:entry colname="col6">0.76</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2010</oasis:entry>
         <oasis:entry colname="col2">797</oasis:entry>
         <oasis:entry colname="col3">2.63</oasis:entry>
         <oasis:entry colname="col4">0.61</oasis:entry>
         <oasis:entry colname="col5">0.28</oasis:entry>
         <oasis:entry colname="col6">0.74</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2011</oasis:entry>
         <oasis:entry colname="col2">672</oasis:entry>
         <oasis:entry colname="col3">2.16</oasis:entry>
         <oasis:entry colname="col4">0.51</oasis:entry>
         <oasis:entry colname="col5">0.09</oasis:entry>
         <oasis:entry colname="col6">0.79</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2012</oasis:entry>
         <oasis:entry colname="col2">1059</oasis:entry>
         <oasis:entry colname="col3">3.61</oasis:entry>
         <oasis:entry colname="col4">0.93</oasis:entry>
         <oasis:entry colname="col5">0.68</oasis:entry>
         <oasis:entry colname="col6">0.65</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">2013</oasis:entry>
         <oasis:entry colname="col2">510</oasis:entry>
         <oasis:entry colname="col3">1.94</oasis:entry>
         <oasis:entry colname="col4">0.49</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M293" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.16</oasis:entry>
         <oasis:entry colname="col6">0.77</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2014 (val)</oasis:entry>
         <oasis:entry colname="col2">650</oasis:entry>
         <oasis:entry colname="col3">2.22</oasis:entry>
         <oasis:entry colname="col4">0.53</oasis:entry>
         <oasis:entry colname="col5">0.05</oasis:entry>
         <oasis:entry colname="col6">0.77</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2015 (gap)</oasis:entry>
         <oasis:entry colname="col2">611</oasis:entry>
         <oasis:entry colname="col3">2.28</oasis:entry>
         <oasis:entry colname="col4">0.57</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M294" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.01</oasis:entry>
         <oasis:entry colname="col6">0.73</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2016 (test)</oasis:entry>
         <oasis:entry colname="col2">726</oasis:entry>
         <oasis:entry colname="col3">2.30</oasis:entry>
         <oasis:entry colname="col4">0.56</oasis:entry>
         <oasis:entry colname="col5">0.17</oasis:entry>
         <oasis:entry colname="col6">0.78</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>


</app>

<app id="App1.Ch1.S2">
  <label>Appendix B</label><title>Spatial Sub-sampling</title>
      <p id="d2e5978">The spatial sub-sampling of 5000 grid-cells is performed according to predefined zone specific probabilities. The probabilities are chosen to be of 65 % for the ablation zone, 30 % for the percolation zone, and 5 % for the dry-snow zone. Here, the zones are defined based on the SMB from the 10 year time range 1990–1999, with the ablation zone having negative SMB, the dry-snow zone showing melt below 100 mm w.e. per year for each year, and the percolation zone covering the remaining grid cells of positive SMB but non-negligible melt (Fig. <xref ref-type="fig" rid="FB1"/>). Alternative sampling strategies were not formally evaluated, as the chosen approach was sufficient for the modeling objectives and computational constraints of this study. Model results indicate that the main source of predictive error is related to unknown surface conditions, especially albedo, rather than the quantity of training data. This suggests that increasing the amount of training data would not necessarily reduce prediction errors, and that effective model training could potentially be achieved with fewer data points.</p>

      <fig id="FB1"><label>Figure B1</label><caption><p id="d2e5985">Ablation (blue), percolation (brown), and dry-snow (green) zone used for spatial sub-sampling.</p></caption>
        <graphic xlink:href="https://tc.copernicus.org/articles/20/3313/2026/tc-20-3313-2026-f09.png"/>

      </fig>


</app>

<app id="App1.Ch1.S3">
  <label>Appendix C</label><title>Network tuning</title>
      <p id="d2e6004">Hyperparameters in ML algorithms are settings that control the algorithm’s behavior, but are not adapted by the learning algorithm itself. This includes the choices of the architecture itself, its capacity, the activation function, regularization techniques, initialization, optimization algorithms and their specific setting, and more. Since it is unfeasible to tune all these hyperparameters, well-informed choices must be made to prioritize the most impactful parameters. The Modular NN architecture was defined based on physical principles, and we explore different architectural configurations to identify the most suitable design. The network capacity (number of layers and neurons per layer) was chosen to be sufficiently large, as evidenced by overfitting observed during training. To prevent using an overfitted model, we select the model weights that yield the lowest validation loss, which effectively corresponds to regularization via early stopping. The batch size is fixed at the maximum value permitted by available computing resources. Given these design choices, we primarily focus on tuning the learning rate for each configuration, as it is the most critical factor for training convergence and optimization performance <xref ref-type="bibr" rid="bib1.bibx18" id="paren.46"/>.</p>
      <p id="d2e6010">Table <xref ref-type="table" rid="TC1"/> presents the overview of the NN configurations, the tuned hyperparameter ranges, alongside RMSE, MAE, and MBE of the validation set of the best performing model for each configuration across all tuning trials. We define the best performing model by the sum of the relative MAE and MBE w.r.t. the seasonal anomaly errors. Fig. <xref ref-type="fig" rid="FC1"/>a shows the performance of all the trials, with the more complex models having consistently better performance than the less complex models. The results of Multitarget NN are not plotted, since they strongly coincide with the performance of Modular NN.</p>
      <p id="d2e6017">For each of the five configurations, we tuned the learning rate. For Modular NN, we additionally tuned the number <inline-formula><mml:math id="M295" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> of preceding days used as input, to determine the optimal number of input days, which was then fixed when tuning the subsequent configurations. As Multitarget NN has a composite loss function consisting of both melt and albedo terms, the weighting between those two components is crucial. Therefore, while tuning the learning rate we simultaneously varied the weighting factors of melt MSE and albedo MSE for calculating the total loss, testing the following melt:albedo weight–ratios: <inline-formula><mml:math id="M296" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>:</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M297" display="inline"><mml:mrow><mml:mn mathvariant="normal">7</mml:mn><mml:mo>:</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M298" display="inline"><mml:mrow><mml:mn mathvariant="normal">9</mml:mn><mml:mo>:</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>, and <inline-formula><mml:math id="M299" display="inline"><mml:mrow><mml:mn mathvariant="normal">3</mml:mn><mml:mo>:</mml:mo><mml:mn mathvariant="normal">7</mml:mn></mml:mrow></mml:math></inline-formula>. Further, we took a second attempt using trainable weights as proposed in <xref ref-type="bibr" rid="bib1.bibx7" id="text.47"/>.</p>
      <p id="d2e6079">We trained the configurations one after another, using insights from each result to guide the development of the next configuration. While performance generally improves with more preceding days, gains become marginal beyond 8 d, with 9 d achieving the best score. Therefore, the subsequently trained configurations Short-term NN, Autoreg NN, and Multitarget NN use 10 input days (i.e., <inline-formula><mml:math id="M300" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">9</mml:mn></mml:mrow></mml:math></inline-formula> preceding days), with the learning rate being tuned for 33 trials. We also narrowed the range of possible learning rate values when progressing through the different configurations as we gained insight into which ranges make sense.</p>
      <p id="d2e6096">When training the autoregressive NN under teacher-forced mode, we use the true melt with random noise, i.e., SM<inline-formula><mml:math id="M301" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mi mathvariant="italic">ε</mml:mi></mml:mrow></mml:math></inline-formula> with <inline-formula><mml:math id="M302" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>∈</mml:mo><mml:mi mathvariant="script">N</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">0.1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> to get more robustness and not rely too much on the previous melt input. The results of Autoreg NN (teacher) show that the model benefits significantly from knowing the previous day's surface melt. However, this advantage diminishes when the model is evaluated in inference mode, i.e., when using the previous prediction instead of the true melt. We alternatively tested different strategies of training autoregressively with using the previous predicted melt <inline-formula><mml:math id="M303" display="inline"><mml:mrow><mml:mover accent="true"><mml:mi mathvariant="normal">SM</mml:mi><mml:mo stretchy="true" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> during training, using different ratios of teacher-forced versus true melt, and different lengths of rollout windows. Although RMSE slightly improved, MAE and MBE did not decrease, and the same error patterns observed for the modular NN (which are discussed in the qualitative assessment in Sect. <xref ref-type="sec" rid="Ch1.S3.SS1.SSS2"/>) remained. Furthermore, the autoregressive training requires much higher computing resources, since a prediction for a specific day requires to make the prediction for the previous day(s), which also required a decreased batch size during training.</p>
      <p id="d2e6164">As baseline for information gain from the variable albedo, we retrain Modular NN including albedo in the set of daily input variables. This leads to a RMSE of 0.22, MAE of 0.05, and MBE of <inline-formula><mml:math id="M304" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.25</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> mm w.e. per day on the validation set.</p>
      <p id="d2e6185">Modular NN is then trained using different variable subsets as inputs as defined in Table <xref ref-type="table" rid="T2"/>. Each of the variations have been trained 33 times with optimizing the learning rate within the range <inline-formula><mml:math id="M305" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. For each subset, the performance scores of the best performing model across all trials (in terms of the sum of the relative MAE and MBE w.r.t. the seasonal anomaly) are listed in Table <xref ref-type="table" rid="TC2"/>. Figure <xref ref-type="fig" rid="FC1"/>b shows the performance of all the trials.</p><table-wrap id="TC1"><label>Table C1</label><caption><p id="d2e6226">Network Tuning: Overview of the five network configurations with the tuning parameter ranges, and their validation scores in mm w.e. per day. The performance of Autoreg NN is stated in teacher-forced and in inference mode.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Optuna study</oasis:entry>
         <oasis:entry colname="col3">RMSE</oasis:entry>
         <oasis:entry colname="col4">MAE</oasis:entry>
         <oasis:entry colname="col5">MBE</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Climatology</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">2.22</oasis:entry>
         <oasis:entry colname="col4">0.53</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M306" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">5.29</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Regression NN</oasis:entry>
         <oasis:entry colname="col2">10 trials: <inline-formula><mml:math id="M307" display="inline"><mml:mrow><mml:mi>l</mml:mi><mml:mi>r</mml:mi><mml:mo>∈</mml:mo><mml:mo>(</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">4</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">1.48</oasis:entry>
         <oasis:entry colname="col4">0.34</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M308" display="inline"><mml:mrow><mml:mn mathvariant="normal">5.27</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Modular NN</oasis:entry>
         <oasis:entry colname="col2">50 trials: <inline-formula><mml:math id="M309" display="inline"><mml:mrow><mml:mi>l</mml:mi><mml:mi>r</mml:mi><mml:mo>∈</mml:mo><mml:mo>(</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">4</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>,</oasis:entry>
         <oasis:entry colname="col3">0.86</oasis:entry>
         <oasis:entry colname="col4">0.16</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M310" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.15</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M311" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:math></inline-formula> days <inline-formula><mml:math id="M312" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>∈</mml:mo><mml:mo>[</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">10</mml:mn><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Short-term NN</oasis:entry>
         <oasis:entry colname="col2">33 trials: <inline-formula><mml:math id="M313" display="inline"><mml:mrow><mml:mi>l</mml:mi><mml:mi>r</mml:mi><mml:mo>∈</mml:mo><mml:mo>(</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">1.13</oasis:entry>
         <oasis:entry colname="col4">0.22</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M314" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.08</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Autoreg NN (teacher)</oasis:entry>
         <oasis:entry colname="col2">33 trials: <inline-formula><mml:math id="M315" display="inline"><mml:mrow><mml:mi>l</mml:mi><mml:mi>r</mml:mi><mml:mo>∈</mml:mo><mml:mo>(</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">0.34</oasis:entry>
         <oasis:entry colname="col4">0.06</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M316" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.06</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">inference</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">0.86</oasis:entry>
         <oasis:entry colname="col4">0.14</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M317" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.01</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Multitarget NN</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M318" display="inline"><mml:mrow><mml:mn mathvariant="normal">33</mml:mn><mml:mo>+</mml:mo><mml:mn mathvariant="normal">33</mml:mn></mml:mrow></mml:math></inline-formula> trials: <inline-formula><mml:math id="M319" display="inline"><mml:mrow><mml:mi>l</mml:mi><mml:mi>r</mml:mi><mml:mo>∈</mml:mo><mml:mo>(</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>,</oasis:entry>
         <oasis:entry colname="col3">0.84</oasis:entry>
         <oasis:entry colname="col4">0.15</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M320" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.11</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">loss weights (manual <inline-formula><mml:math id="M321" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> trainable)</oasis:entry>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

<table-wrap id="TC2"><label>Table C2</label><caption><p id="d2e6756">Input selection study: Overview of Modular NN using different input variable subsets, with the validation scores in mm w.e. per day.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="left"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">RMSE</oasis:entry>
         <oasis:entry colname="col3">MAE</oasis:entry>
         <oasis:entry colname="col4">MBE</oasis:entry>
         <oasis:entry colname="col5"/>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Climatology</oasis:entry>
         <oasis:entry colname="col2">2.22</oasis:entry>
         <oasis:entry colname="col3">0.53</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M322" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">5.29</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Modular NN</oasis:entry>
         <oasis:entry colname="col2">0.86</oasis:entry>
         <oasis:entry colname="col3">0.16</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M323" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.15</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">EBMT<sub><italic>d</italic></sub></oasis:entry>
         <oasis:entry colname="col2">0.83</oasis:entry>
         <oasis:entry colname="col3">0.15</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M325" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.06</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">EBMT</oasis:entry>
         <oasis:entry colname="col2">0.81</oasis:entry>
         <oasis:entry colname="col3">0.14</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M326" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.10</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">EBM</oasis:entry>
         <oasis:entry colname="col2">0.80</oasis:entry>
         <oasis:entry colname="col3">0.14</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M327" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.09</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">EBS</oasis:entry>
         <oasis:entry colname="col2">0.81</oasis:entry>
         <oasis:entry colname="col3">0.14</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M328" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.00</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">EBR</oasis:entry>
         <oasis:entry colname="col2">1.12</oasis:entry>
         <oasis:entry colname="col3">0.22</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M329" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.19</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">EBM<sub>noDOY</sub></oasis:entry>
         <oasis:entry colname="col2">0.89</oasis:entry>
         <oasis:entry colname="col3">0.16</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M331" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.05</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"/>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

<fig id="FC1"><label>Figure C1</label><caption><p id="d2e7088">MAE and MBE on the validation set of the tuned models. The best performing model for each configuration is indicated by the black outlined shapes. <bold>(a)</bold> Models from the network configuration study: The seasonal signal (black circle) is the melt climatology over the years 1990–2013, smoothed with a 15 days window. The tuned networks Regression NN, Modular NN, Short-term NN, and Autoreg NN (in inference mode) are indicated by circles; Autreg NN in teacher-forced mode cannot be interpreted as applicable model, since it relies on the true melt and can only be used in its inference state, and is indicated with diamond symbol. Modular NN is indicated with the “plus” symbols, and shows the results for the input ablation study. The vertical gray dotted lines indicate the MAE range in panel <bold>(b)</bold>. <bold>(b)</bold> Models from the input selection study: The Modular NN from the network configuration study is again depicted in orange color.</p></caption>
        
        <graphic xlink:href="https://tc.copernicus.org/articles/20/3313/2026/tc-20-3313-2026-f10.png"/>

      </fig>


</app>

<app id="App1.Ch1.S4">
  <label>Appendix D</label><title>Assessment of predicted outliers</title>

      <fig id="FD1"><label>Figure D1</label><caption><p id="d2e7120">True melt, and predicted melt by Modular NN with associated residual for three consecutive days in April 2016, with <bold>(b)</bold> causing the positive residual outliers shown in Fig. <xref ref-type="fig" rid="F3"/>c. The blue and red triangles on the residual color bar indicate the highest negative and positive residual of that day, respectively.</p></caption>
        
        <graphic xlink:href="https://tc.copernicus.org/articles/20/3313/2026/tc-20-3313-2026-f11.jpg"/>

      </fig>


</app>

<app id="App1.Ch1.S5">
  <label>Appendix E</label><title>Daily Maps</title>

      <fig id="FE1"><label>Figure E1</label><caption><p id="d2e7148">Surface melt for 21 April 2016 with melt extent (ME) in km<sup>2</sup>, median and IQR of melt in mm w.e. per day. <bold>(a)</bold> True melt, <bold>(b)</bold>–<bold>(e)</bold>, <bold>(h)</bold> predicted melt (left panels) and residuals (right panels); <bold>(f)</bold> true albedo and <bold>(g)</bold> predicted albedo and residual as additional outputs of Multitarget NN.</p></caption>
        
        <graphic xlink:href="https://tc.copernicus.org/articles/20/3313/2026/tc-20-3313-2026-f12.jpg"/>

      </fig>

<fig id="FE2"><label>Figure E2</label><caption><p id="d2e7190">Surface melt for 21 May 2016 with melt extent (ME) in km<sup>2</sup>, median and IQR of melt in mm w.e. per day. <bold>(a)</bold> True melt, <bold>(b)</bold>–<bold>(e)</bold>, <bold>(h)</bold> predicted melt (left panels) and residuals (right panels); <bold>(f)</bold> true albedo and <bold>(g)</bold> predicted albedo and residual as additional outputs of Multitarget NN.</p></caption>
        
        <graphic xlink:href="https://tc.copernicus.org/articles/20/3313/2026/tc-20-3313-2026-f13.jpg"/>

      </fig>

<fig id="FE3"><label>Figure E3</label><caption><p id="d2e7233">Surface melt for 21 June 2016 with melt extent (ME) in km<sup>2</sup>, median and IQR of melt in mm w.e. per day. <bold>(a)</bold> True melt, <bold>(b)</bold>–<bold>(e)</bold>, <bold>(h)</bold> predicted melt (left panels) and residuals (right panels); <bold>(f)</bold> true albedo and <bold>(g)</bold> predicted albedo and residual as additional outputs of Multitarget NN.</p></caption>
        
        <graphic xlink:href="https://tc.copernicus.org/articles/20/3313/2026/tc-20-3313-2026-f14.jpg"/>

      </fig>

<fig id="FE4"><label>Figure E4</label><caption><p id="d2e7275">Surface melt for 21 August 2016 with melt extent (ME) in km<sup>2</sup>, median and IQR of melt in mm w.e. per day. <bold>(a)</bold> True melt, <bold>(b)</bold>–<bold>(e)</bold>, <bold>(h)</bold> predicted melt (left panels) and residuals (right panels); <bold>(f)</bold> true albedo and <bold>(g)</bold> predicted albedo and residual as additional outputs of Multitarget NN.</p></caption>
        
        <graphic xlink:href="https://tc.copernicus.org/articles/20/3313/2026/tc-20-3313-2026-f15.jpg"/>

      </fig>

<fig id="FE5"><label>Figure E5</label><caption><p id="d2e7317">Surface melt for 21 September 2016 with melt extent (ME) in km<sup>2</sup>, median and IQR of melt in mm w.e. per day. <bold>(a)</bold> True melt, <bold>(b)</bold>–<bold>(e)</bold>, <bold>(h)</bold> predicted melt (left panels) and residuals (right panels); <bold>(f)</bold> true albedo and <bold>(g)</bold> predicted albedo and residual as additional outputs of Multitarget NN.</p></caption>
        
        <graphic xlink:href="https://tc.copernicus.org/articles/20/3313/2026/tc-20-3313-2026-f16.png"/>

      </fig>


</app>

<app id="App1.Ch1.S6">
  <label>Appendix F</label><title>2D hexagonal binning plots of Modular NN input selection study</title>

      <fig id="FF1"><label>Figure F1</label><caption><p id="d2e7369">2D hexagonal binning plots of true versus predicted surface melt of the test set of Modular NN using different input subsets. The logarithmic color bar is valid for bins containing up to <inline-formula><mml:math id="M337" display="inline"><mml:mrow><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">5</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> data points; bins containing more than <inline-formula><mml:math id="M338" display="inline"><mml:mrow><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">5</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> points are indicated in black for better visibility.</p></caption>
        
        <graphic xlink:href="https://tc.copernicus.org/articles/20/3313/2026/tc-20-3313-2026-f17.jpg"/>

      </fig>

</app>
  </app-group><notes notes-type="codedataavailability"><title>Code and data availability</title>

      <p id="d2e7406">Code is published under the MIT License and can be accessed via the GitHub repository <uri>https://github.com/eschlager/MeltEmulation/tree/revision</uri> (last access: 28 May 2026) or the Zenodo archive <ext-link xlink:href="https://doi.org/10.5281/zenodo.20271069" ext-link-type="DOI">10.5281/zenodo.20271069</ext-link> <xref ref-type="bibr" rid="bib1.bibx42" id="paren.48"/>. The data produced in this study is available at <ext-link xlink:href="https://doi.org/10.5281/zenodo.19627367" ext-link-type="DOI">10.5281/zenodo.19627367</ext-link> <xref ref-type="bibr" rid="bib1.bibx43" id="paren.49"/>. The HIRHAM5 simulation data is freely available upon request <xref ref-type="bibr" rid="bib1.bibx26" id="paren.50"/>.</p>
  </notes><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d2e7431">ES, PL, and RM conceptualized the study and designed the methodology. ES did the code implementation, computational experiments, and visualization. ES performed the analysis and validation with consultation from SS. ES prepared the original draft of the manuscript, with help from SS, PL, and RM. RM provided the financial support and access to computing resources.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d2e7439">At least one of the (co-)authors is a member of the editorial board of <italic>The Cryosphere</italic>. The peer-review process was guided by an independent editor, and the authors also have no other competing interests to declare.</p>
  </notes><notes notes-type="disclaimer"><title>Disclaimer</title>

      <p id="d2e7448">Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.</p>
  </notes><ack><title>Acknowledgements</title><p id="d2e7454">The original climate model simulations used in this project were carried out under the framework of the European Union's Horizon 2020 research project PROTECT under grant agreement 869304. We gratefully acknowledge the computing resources provided by the European Weather Cloud under special project dkmottr2. Claude Sonnet 4.5 was used for grammar check and to improve readability.</p></ack><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d2e7459">ES received support for this study from the National Centre for Climate Research (NCKF) and some additional support from the Novo Nordisk funded PRECISE project (grant no. NNF23OC0081251). SS received financial support from FWF project Weg_re (grant no. 10.55776/P35388).</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d2e7465">This paper was edited by Andrew Orr and reviewed by two anonymous referees.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bibx1"><label>Akiba et al.(2019)Akiba, Sano, Yanase, Ohta, and Koyama</label><mixed-citation>Akiba, T., Sano, S., Yanase, T., Ohta, T., and Koyama, M.: Optuna: A next-generation hyperparameter optimization framework, in: Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery &amp; data mining,  2623–2631, <ext-link xlink:href="https://doi.org/10.1145/3292500.3330701" ext-link-type="DOI">10.1145/3292500.3330701</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx2"><label>Anilkumar et al.(2023)Anilkumar, Bharti, Chutia, and Aggarwal</label><mixed-citation>Anilkumar, R., Bharti, R., Chutia, D., and Aggarwal, S. P.: Modelling point mass balance for the glaciers of the Central European Alps using machine learning techniques, The Cryosphere, 17, 2811–2828, <ext-link xlink:href="https://doi.org/10.5194/tc-17-2811-2023" ext-link-type="DOI">10.5194/tc-17-2811-2023</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx3"><label>Auffarth(2021)</label><mixed-citation> Auffarth, B.: Machine learning for time-series with Python, Packt Publishing United Kingdom, ISBN: 9781801819626, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx4"><label>Bolibar et al.(2020)Bolibar, Rabatel, Gouttevin, Galiez, Condom, and Sauquet</label><mixed-citation>Bolibar, J., Rabatel, A., Gouttevin, I., Galiez, C., Condom, T., and Sauquet, E.: Deep learning applied to glacier evolution modelling, The Cryosphere, 14, 565–584, <ext-link xlink:href="https://doi.org/10.5194/tc-14-565-2020" ext-link-type="DOI">10.5194/tc-14-565-2020</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx5"><label>Bolibar et al.(2022)Bolibar, Rabatel, Gouttevin, Zekollari, and Galiez</label><mixed-citation>Bolibar, J., Rabatel, A., Gouttevin, I., Zekollari, H., and Galiez, C.: Nonlinear sensitivity of glacier mass balance to future climate change unveiled by deep learning, Nat. Commun., 13, 409, <ext-link xlink:href="https://doi.org/10.1038/s41467-022-28033-0" ext-link-type="DOI">10.1038/s41467-022-28033-0</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx6"><label>Chen and Guestrin(2016)</label><mixed-citation>Chen, T. and Guestrin, C.: XGBoost: A Scalable Tree Boosting System, in: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '16, 785–794, Association for Computing Machinery, New York, NY, USA, San Francisco, California, USA, ISBN 9781450342322, <ext-link xlink:href="https://doi.org/10.1145/2939672.2939785" ext-link-type="DOI">10.1145/2939672.2939785</ext-link>,  2016.</mixed-citation></ref>
      <ref id="bib1.bibx7"><label>Cipolla et al.(2018)Cipolla, Gal, and Kendall</label><mixed-citation>Cipolla, R., Gal, Y., and Kendall, A.: Multi-task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics, in: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition,   7482–7491, ISBN 2575-7075, <ext-link xlink:href="https://doi.org/10.1109/CVPR.2018.00781" ext-link-type="DOI">10.1109/CVPR.2018.00781</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx8"><label>de Burgh-Day and Leeuwenburg(2023)</label><mixed-citation>de Burgh-Day, C. O. and Leeuwenburg, T.: Machine learning for numerical weather and climate modelling: a review, Geosci. Model Dev., 16, 6433–6477, <ext-link xlink:href="https://doi.org/10.5194/gmd-16-6433-2023" ext-link-type="DOI">10.5194/gmd-16-6433-2023</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx9"><label>de Roda Husman et al.(2024)de Roda Husman, Hu, van Tiggelen, Dell, Bolibar, Lhermitte, Wouters, and Munneke</label><mixed-citation>de Roda Husman, S., Hu, Z., van Tiggelen, M., Dell, R., Bolibar, J., Lhermitte, S., Wouters, B., and Munneke, P. K.: Physically-informed super-resolution downscaling of Antarctic surface melt, Wiley Online Library, J. Adv. Model. Earth Syst., 16, e2023MS004212, <ext-link xlink:href="https://doi.org/10.1029/2023MS004212" ext-link-type="DOI">10.1029/2023MS004212</ext-link>,  2024.</mixed-citation></ref>
      <ref id="bib1.bibx10"><label>Dubey et al.(2022)Dubey, Singh, and Chaudhuri</label><mixed-citation>Dubey, S. R., Singh, S. K., and Chaudhuri, B. B.: Activation functions in deep learning: A comprehensive survey and benchmark, Neurocomputing, 503, 92–108, <ext-link xlink:href="https://doi.org/10.1016/j.neucom.2022.06.111" ext-link-type="DOI">10.1016/j.neucom.2022.06.111</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx11"><label>Dunmire et al.(2024)Dunmire, Wever, Banwell, and Lenaerts</label><mixed-citation>Dunmire, D., Wever, N., Banwell, A. F., and Lenaerts, J. T. M.: Antarctic-wide ice-shelf firn emulation reveals robust future firn air depletion signal for the Antarctic Peninsula, Commun. Earth   Environ., 5, 100, <ext-link xlink:href="https://doi.org/10.1038/s43247-024-01255-4" ext-link-type="DOI">10.1038/s43247-024-01255-4</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx12"><label>Fettweis et al.(2013)</label><mixed-citation>Fettweis, X., Franco, B., Tedesco, M., van Angelen, J. H., Lenaerts, J. T. M., van den Broeke, M. R., and Gallée, H.: Estimating the Greenland ice sheet surface mass balance contribution to future sea level rise using the regional atmospheric climate model MAR, The Cryosphere, 7, 469–489, <ext-link xlink:href="https://doi.org/10.5194/tc-7-469-2013" ext-link-type="DOI">10.5194/tc-7-469-2013</ext-link>,   2013.</mixed-citation></ref>
      <ref id="bib1.bibx13"><label>Fettweis et al.(2017)</label><mixed-citation>Fettweis, X., Box, J. E., Agosta, C., Amory, C., Kittel, C., Lang, C., van As, D., Machguth, H., and Gallée, H.: Reconstructions of the 1900–2015 Greenland ice sheet surface mass balance using the regional climate MAR model, The Cryosphere, 11, 1015–1033, <ext-link xlink:href="https://doi.org/10.5194/tc-11-1015-2017" ext-link-type="DOI">10.5194/tc-11-1015-2017</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx14"><label>Fettweis et al.(2020)Fettweis, Hofer, Krebs-Kanzow, Amory, Aoki, Berends, Born, Box, Delhasse, Fujita, Gierz, Goelzer, Hanna, Hashimoto, Huybrechts, Kapsch, King, Kittel, Lang, Langen, Lenaerts, Liston, Lohmann, Mernild, Mikolajewicz, Modali, Mottram, Niwano, Noël, Ryan, Smith, Streffing, Tedesco, van de Berg, van den Broeke, van de Wal, van Kampenhout, Wilton, Wouters, Ziemen, and Zolles</label><mixed-citation>Fettweis, X., Hofer, S., Krebs-Kanzow, U., Amory, C., Aoki, T., Berends, C. J., Born, A., Box, J. E., Delhasse, A., Fujita, K., Gierz, P., Goelzer, H., Hanna, E., Hashimoto, A., Huybrechts, P., Kapsch, M.-L., King, M. D., Kittel, C., Lang, C., Langen, P. L., Lenaerts, J. T. M., Liston, G. E., Lohmann, G., Mernild, S. H., Mikolajewicz, U., Modali, K., Mottram, R. H., Niwano, M., NoÃ«l, B., Ryan, J. C., Smith, A., Streffing, J., Tedesco, M., van de Berg, W. J., van den Broeke, M., van de Wal, R. S. W., van Kampenhout, L., Wilton, D., Wouters, B., Ziemen, F., and Zolles, T.: GrSMBMIP: intercomparison of the modelled 1980–2012 surface mass balance over the Greenland Ice Sheet, The Cryosphere, 14, 3935–3958, <ext-link xlink:href="https://doi.org/10.5194/tc-14-3935-2020" ext-link-type="DOI">10.5194/tc-14-3935-2020</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx15"><label>Flora et al.(2024)Flora, Potvin, McGovern, and Handler</label><mixed-citation>Flora, M. L., Potvin, C. K., McGovern, A., and Handler, S.: A machine learning explainability tutorial for atmospheric sciences, Artificial Intelligence for the Earth Systems, 3, e230018, <ext-link xlink:href="https://doi.org/10.1175/AIES-D-23-0018.1" ext-link-type="DOI">10.1175/AIES-D-23-0018.1</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx16"><label>Glaude et al.(2024)</label><mixed-citation>Glaude, Q., Noël, B., Olesen, M., Van den Broeke, M., van de Berg, W. J., Mottram, R., Hansen, N., Delhasse, A., Amory, C., and Kittel, C.: A factor two difference in 21st-century Greenland ice sheet surface mass balance projections from three regional climate models under a strong warming scenario (SSP5-8.5), Geophys. Res. Lett., 51, e2024GL111902, <ext-link xlink:href="https://doi.org/10.1029/2024GL111902" ext-link-type="DOI">10.1029/2024GL111902</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx17"><label>Goelzer et al.(2013)</label><mixed-citation>Goelzer, H., Huybrechts, P., Fürst, J., Nick, F., Andersen, M., Edwards, T., Fettweis, X., Payne, A., and Shannon, S.: Sensitivity of Greenland Ice Sheet Projections to Model Formulations, J. Glaciol., 59, 733–749, <ext-link xlink:href="https://doi.org/10.3189/2013JoG12J182" ext-link-type="DOI">10.3189/2013JoG12J182</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx18"><label>Goodfellow et al.(2016)Goodfellow, Bengio, and Courville</label><mixed-citation>Goodfellow, I., Bengio, Y., and Courville, A.: Deep Learning, MIT Press, <uri>https://www.deeplearningbook.org</uri> (last access: 28 May 2026), 2016.</mixed-citation></ref>
      <ref id="bib1.bibx19"><label>Géron(2019)</label><mixed-citation> Géron, A.: Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow, O'Reilly Media, Inc., 2 Edn., ISBN 9781492032649, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx20"><label>Hadjipetrou(2026)</label><mixed-citation>Hadjipetrou, S.: A review of statistical methods for climate downscaling: the underexplored potential of geostatistical simulation, Theor. Appl. Climatol., 157, 216, <ext-link xlink:href="https://doi.org/10.1007/s00704-026-06120-2" ext-link-type="DOI">10.1007/s00704-026-06120-2</ext-link>, 2026.</mixed-citation></ref>
      <ref id="bib1.bibx21"><label>Hochreiter and Schmidhuber(1997)</label><mixed-citation> Hochreiter, S. and Schmidhuber, J.: Long short-term memory, Neural Comput., 9, 1735–1780,   1997.</mixed-citation></ref>
      <ref id="bib1.bibx22"><label>Hu et al.(2021)Hu, Kuipers Munneke, Lhermitte, Izeboud, and Van Den Broeke</label><mixed-citation>Hu, Z., Kuipers Munneke, P., Lhermitte, S., Izeboud, M., and Van Den Broeke, M.: Improving surface melt estimation over the Antarctic Ice Sheet using deep learning: a proof of concept over the Larsen Ice Shelf, The Cryosphere, 15, 5639–5658, <ext-link xlink:href="https://doi.org/10.5194/tc-15-5639-2021" ext-link-type="DOI">10.5194/tc-15-5639-2021</ext-link>,  2021.</mixed-citation></ref>
      <ref id="bib1.bibx23"><label>Jeffries et al.(2014)Jeffries, Richter-Menge, and Overland</label><mixed-citation>Jeffries, M. O., Richter-Menge, J., and Overland, J. E.: Arctic report card 2014, Tech. rep., <uri>https://arctic.noaa.gov/report-card/report-card-archive/</uri> (last access: 28 May 2026), 2014.</mixed-citation></ref>
      <ref id="bib1.bibx24"><label>Jiang et al.(2024)Jiang, Sweet, Blougouras, Brenning, Li, Reichstein, Denzler, Shangguan, Yu, Huang, and Zscheischler</label><mixed-citation>Jiang, S., Sweet, L.-b., Blougouras, G., Brenning, A., Li, W., Reichstein, M., Denzler, J., Shangguan, W., Yu, G., Huang, F., and Zscheischler, J.: How Interpretable Machine Learning Can Benefit Process Understanding in the Geosciences, Earth's Future, 12, e2024EF004540, <ext-link xlink:href="https://doi.org/10.1029/2024EF004540" ext-link-type="DOI">10.1029/2024EF004540</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx25"><label>Kingma and Ba(2014)</label><mixed-citation> Kingma, D. P. and Ba, J.: Adam: A method for stochastic optimization, arXiv preprint arXiv:1412.6980, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx26"><label>Langen et al.(2017)Langen, Fausto, Vandecrux, Mottram, and Box</label><mixed-citation>Langen, P. L., Fausto, R. S., Vandecrux, B., Mottram, R. H., and Box, J. E.: Liquid Water Flow and Retention on the Greenland Ice Sheet in the Regional Climate Model HIRHAM5: Local and Large-Scale Impacts, Front. Earth Sci., 4, <ext-link xlink:href="https://doi.org/10.3389/feart.2016.00110" ext-link-type="DOI">10.3389/feart.2016.00110</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx27"><label>Lenaerts et al.(2020)Lenaerts, Camron, Wyburn-Powell, and Kay</label><mixed-citation>Lenaerts, J., Camron, M. D., Wyburn-Powell, C. R., and Kay, J. E.: Present-day and future Greenland Ice Sheet precipitation frequency from CloudSat observations and the Community Earth System Model, The Cryosphere, 14, 2253–2265, <ext-link xlink:href="https://doi.org/10.5194/tc-14-2253-2020" ext-link-type="DOI">10.5194/tc-14-2253-2020</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx28"><label>Lucas-Picher et al.(2012)</label><mixed-citation>Lucas-Picher, P., Wulff-Nielsen, M., Christensen, J. H., Aðalgeirsdóttir, G., Mottram, R., and Simonsen, S. B.: Very high resolution regional climate model simulations over Greenland: Identifying added value, J. Geophys. Res.-Atmos., 117, <ext-link xlink:href="https://doi.org/10.1029/2011JD016267" ext-link-type="DOI">10.1029/2011JD016267</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx29"><label>Lundberg and Lee(2017)</label><mixed-citation> Lundberg, S. M. and Lee, S.-I.: A unified approach to interpreting model predictions, in: Proceedings of the 31st International Conference on Neural Information Processing Systems,  4768–4777, Curran Associates Inc., Long Beach, California, USA, ISBN 9781510860964, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx30"><label>Mankin et al.(2020)Mankin, Lehner, Coats, and McKinnon</label><mixed-citation>Mankin, J. S., Lehner, F., Coats, S., and McKinnon, K. A.: The value of initial condition large ensembles to robust adaptation decision-making, Earth's Future, 8, <ext-link xlink:href="https://doi.org/10.1029/2020EF001610" ext-link-type="DOI">10.1029/2020EF001610</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx31"><label>Meredith et al.(2019)</label><mixed-citation>Meredith, M., Sommerkorn, M., Cassotta, S., Derksen, C., Ekaykin, A., Hollowed, A., Kofinas, G., Mackintosh, A., Melbourne-Thomas, J., Muelbert, M., Ottersen, G., Pritchard, H., and Schuur, E.: Polar Regions, in: IPCC Special Report on the Ocean and Cryosphere in a Changing Climate, edited by: Pörtner, H.-O., Roberts, D. C., Masson-Delmotte, V., Zhai, P., Tignor, M., Poloczanska, E., Mintenbeck, K., Alegría, A., Nicolai, M., Okem, A., Petzold, J., Rama, B., and Weyer, N. M.,  Tech. rep., Cambridge University Press, Cambride, UK and New York, NY, USA, <ext-link xlink:href="https://doi.org/10.1017/9781009157964.005" ext-link-type="DOI">10.1017/9781009157964.005</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx32"><label>Molina et al.(2023)</label><mixed-citation>Molina, M. J., O'Brien, T. A., Anderson, G., Ashfaq, M., Bennett, K. E., Collins, W. D., Dagon, K., Restrepo, J. M., and Ullrich, P. A.: A review of recent and emerging machine learning applications for climate variability and weather phenomena, Artificial Intelligence for the Earth Systems, 2, 220086, <ext-link xlink:href="https://doi.org/10.1175/AIES-D-22-0086.1" ext-link-type="DOI">10.1175/AIES-D-22-0086.1</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx33"><label>Molnar et al.(2022)Molnar, KÃ¶nig, Herbinger, Freiesleben, Dandl, Scholbeck, Casalicchio, Grosse-Wentrup, and Bischl</label><mixed-citation>Molnar, C., KÃ¶nig, G., Herbinger, J., Freiesleben, T., Dandl, S., Scholbeck, C. A., Casalicchio, G., Grosse-Wentrup, M., and Bischl, B.: General Pitfalls of Model-Agnostic Interpretation Methods for Machine Learning Models, in: xxAI – Beyond Explainable AI: International Workshop, Held in Conjunction with ICML 2020, July 18, 2020, Vienna, Austria, Revised and Extended Papers, edited by: Holzinger, A., Goebel, R., Fong, R., Moon, T., MÃ¼ller, K.-R., and Samek, W., 39–68, Springer International Publishing, Cham, ISBN 978-3-031-04083-2, <ext-link xlink:href="https://doi.org/10.1007/978-3-031-04083-2_4" ext-link-type="DOI">10.1007/978-3-031-04083-2_4</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx34"><label>Mottram et al.(2017)Mottram, Boberg, Langen, Yang, Rodehacke, Christensen, and Madsen</label><mixed-citation>Mottram, R., Boberg, F., Langen, P., Yang, S., Rodehacke, C., Christensen, J. H., and Madsen, M. S.: Surface mass balance of the Greenland ice sheet in the regional climate model HIRHAM5: Present state and future prospects, Low Temperature Science, 75, 105–115, <ext-link xlink:href="https://doi.org/10.14943/lowtemsci.75.105" ext-link-type="DOI">10.14943/lowtemsci.75.105</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx35"><label>Noël et al.(2018)</label><mixed-citation>Noël, B., van de Berg, W. J., van Wessem, J. M., van Meijgaard, E., van As, D., Lenaerts, J. T. M., Lhermitte, S., Kuipers Munneke, P., Smeets, C. J. P. P., van Ulft, L. H., van de Wal, R. S. W., and van den Broeke, M. R.: Modelling the climate and surface mass balance of polar ice sheets using RACMO2 – Part 1: Greenland (1958–2016), The Cryosphere, 12, 811–831, <ext-link xlink:href="https://doi.org/10.5194/tc-12-811-2018" ext-link-type="DOI">10.5194/tc-12-811-2018</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx36"><label>Ogunmolasuyi et al.(2025)Ogunmolasuyi, Meyer, McDowell, Thompson-Munson, and Baker</label><mixed-citation>Ogunmolasuyi, A., Meyer, C. R., McDowell, I., Thompson-Munson, M., and Baker, I.: FirnLearn: A neural network-based approach to firn density modeling in Antarctica, J. Glaciol., 71, e71, <ext-link xlink:href="https://doi.org/10.1017/jog.2025.26" ext-link-type="DOI">10.1017/jog.2025.26</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx37"><label>Pan et al.(2025)Pan, Chen, Pan, Huang, Yang, Piao, Zhou, Dai, Chen, and Li</label><mixed-citation>Pan, X., Chen, D., Pan, B., Huang, X., Yang, K., Piao, S., Zhou, T., Dai, Y., Chen, F., and Li, X.: Evolution and prospects of Earth system models: Challenges and opportunities, Earth-Sci. Rev., 260, 104986, <ext-link xlink:href="https://doi.org/10.1016/j.earscirev.2024.104986" ext-link-type="DOI">10.1016/j.earscirev.2024.104986</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx38"><label>Reichstein et al.(2019)Reichstein, Camps-Valls, Stevens, Jung, Denzler, Carvalhais, and Prabhat</label><mixed-citation>Reichstein, M., Camps-Valls, G., Stevens, B., Jung, M., Denzler, J., Carvalhais, N., and Prabhat: Deep learning and process understanding for data-driven Earth system science, Nature, 566, 195–204, <ext-link xlink:href="https://doi.org/10.1038/s41586-019-0912-1" ext-link-type="DOI">10.1038/s41586-019-0912-1</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx39"><label>Richter-Menge et al.(2016)Richter-Menge, Overland, and Mathis</label><mixed-citation>Richter-Menge, J., Overland, J. E., and Mathis, J. T.: Arctic Report Card 2016: persistent warming trend and loss of sea ice are triggering extensive Arctic changes, NOAA, <uri>https://arctic.noaa.gov/report-card/report-card-archive/</uri> (last access: 28 May 2026), 2016.</mixed-citation></ref>
      <ref id="bib1.bibx40"><label>Roberts et al.(2017)Roberts, Bahn, Ciuti, Boyce, Elith, Guillera-Arroita, Hauenstein, Lahoz-Monfort, SchrÃ¶der, Thuiller, Warton, Wintle, Hartig, and Dormann</label><mixed-citation>Roberts, D. R., Bahn, V., Ciuti, S., Boyce, M. S., Elith, J., Guillera-Arroita, G., Hauenstein, S., Lahoz-Monfort, J. J., SchrÃ¶der, B., Thuiller, W., Warton, D. I., Wintle, B. A., Hartig, F., and Dormann, C. F.: Cross-validation strategies for data with temporal, spatial, hierarchical, or phylogenetic structure, Ecography, 40, 913–929, <ext-link xlink:href="https://doi.org/10.1111/ecog.02881" ext-link-type="DOI">10.1111/ecog.02881</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx41"><label>Sadler et al.(2022)Sadler, Appling, Read, Oliver, Jia, Zwart, and Kumar</label><mixed-citation>Sadler, J. M., Appling, A. P., Read, J. S., Oliver, S. K., Jia, X., Zwart, J. A., and Kumar, V.: Multi-Task Deep Learning of Daily Streamflow and Water Temperature, Water Resour. Res., 58, e2021WR030138, <ext-link xlink:href="https://doi.org/10.1029/2021WR030138" ext-link-type="DOI">10.1029/2021WR030138</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx42"><label>Schlager(2026a)</label><mixed-citation>Schlager, E.: Code of Learning to melt: Emulating Greenland surface melt from a polar RCM with machine learning, Zenodo, <ext-link xlink:href="https://doi.org/10.5281/zenodo.20271069" ext-link-type="DOI">10.5281/zenodo.20271069</ext-link>, 2026a.</mixed-citation></ref>
      <ref id="bib1.bibx43"><label>Schlager(2026b)</label><mixed-citation>Schlager, E.: Output of Learning to melt: Emulating Greenland surface melt from a polar RCM with machine learning,  Zenodo, <ext-link xlink:href="https://doi.org/10.5281/zenodo.19627367" ext-link-type="DOI">10.5281/zenodo.19627367</ext-link>, 2026b.</mixed-citation></ref>
      <ref id="bib1.bibx44"><label>Sellevold and Vizcaino(2021)</label><mixed-citation>Sellevold, R. and Vizcaino, M.: First Application of Artificial Neural Networks to Estimate 21st Century Greenland Ice Sheet Surface Melt, Geophys. Res. Lett., 48, <ext-link xlink:href="https://doi.org/10.1029/2021GL092449" ext-link-type="DOI">10.1029/2021GL092449</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx45"><label>Sun et al.(2024)Sun, Deng, Ren, Liu, Deng, and Jin</label><mixed-citation>Sun, Y., Deng, K., Ren, K., Liu, J., Deng, C., and Jin, Y.: Deep learning in statistical downscaling for deriving high spatial resolution gridded meteorological data: A systematic review, ISPRS J. Photogramm., 208, 14–38, <ext-link xlink:href="https://doi.org/10.1016/j.isprsjprs.2023.12.011" ext-link-type="DOI">10.1016/j.isprsjprs.2023.12.011</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx46"><label>Tebaldi et al.(2025)Tebaldi, Selin, Ferrari, and Flierl</label><mixed-citation>Tebaldi, C., Selin, N. E., Ferrari, R., and Flierl, G.: Emulators of climate model output, Ann. Rev. Environ. Res., 50, <ext-link xlink:href="https://doi.org/10.1146/annurev-environ-012125-085838" ext-link-type="DOI">10.1146/annurev-environ-012125-085838</ext-link>,  2025.</mixed-citation></ref>
      <ref id="bib1.bibx47"><label>The IMBIE Team(2020)</label><mixed-citation>The IMBIE Team: Mass balance of the Greenland Ice Sheet from 1992 to 2018, Nature, 579, 233–239, <ext-link xlink:href="https://doi.org/10.1038/s41586-019-1855-2" ext-link-type="DOI">10.1038/s41586-019-1855-2</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx48"><label>Theng and Bhoyar(2024)</label><mixed-citation>Theng, D. and Bhoyar, K. K.: Feature selection techniques for machine learning: a survey of more than two decades of research, Know. Inf. Syst., 66, 1575–1637, <ext-link xlink:href="https://doi.org/10.1007/s10115-023-02010-5" ext-link-type="DOI">10.1007/s10115-023-02010-5</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx49"><label>Tyralis et al.(2021)Tyralis, Papacharalampous, and Langousis</label><mixed-citation>Tyralis, H., Papacharalampous, G., and Langousis, A.: Super ensemble learning for daily streamflow forecasting: Large-scale demonstration and comparison with multiple machine learning algorithms, Neural Comput. Appl., 33, 3053–3068, <ext-link xlink:href="https://doi.org/10.1007/s00521-020-05172-3" ext-link-type="DOI">10.1007/s00521-020-05172-3</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx50"><label>van der Meer et al.(2023)van der Meer, De Roda Husman, and Lhermitte</label><mixed-citation>van der Meer, M., De Roda Husman, S., and Lhermitte, S.: Deep Learning Regional Climate Model Emulators: A Comparison of Two Downscaling Training Frameworks, J. Adv. Model. Earth Syst., 15, e2022MS003593, <ext-link xlink:href="https://doi.org/10.1029/2022MS003593" ext-link-type="DOI">10.1029/2022MS003593</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx51"><label>Vandecrux et al.(2019)</label><mixed-citation>Vandecrux, B., MacFerrin, M., Machguth, H., Colgan, W. T., van As, D., Heilig, A., Stevens, C. M., Charalampidis, C., Fausto, R. S., Morris, E. M., Mosley-Thompson, E., Koenig, L., Montgomery, L. N., Miège, C., Simonsen, S. B., Ingeman-Nielsen, T., and Box, J. E.: Firn data compilation reveals widespread decrease of firn air content in western Greenland, The Cryosphere, 13, 845–859, <ext-link xlink:href="https://doi.org/10.5194/tc-13-845-2019" ext-link-type="DOI">10.5194/tc-13-845-2019</ext-link>, 2019. </mixed-citation></ref>
      <ref id="bib1.bibx52"><label>Vandecrux et al.(2024)</label><mixed-citation>Vandecrux, B., Fausto, R. S., Box, J. E., Covi, F., Hock, R., Rennermalm, Ã. K., Heilig, A., Abermann, J., van As, D., Bjerre, E., Fettweis, X., Smeets, P. C. J. P., Kuipers Munneke, P., van den Broeke, M. R., Brils, M., Langen, P. L., Mottram, R., and Ahlström, A. P.: Recent warming trends of the Greenland ice sheet documented by historical firn and ice temperature observations and machine learning, The Cryosphere, 18, 609–631, <ext-link xlink:href="https://doi.org/10.5194/tc-18-609-2024" ext-link-type="DOI">10.5194/tc-18-609-2024</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx53"><label>Vaswani et al.(2017)Vaswani, Shazeer, Parmar, Uszkoreit, Jones, Gomez, Kaiser, and Polosukhin</label><mixed-citation>Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Å., and Polosukhin, I.: Attention is all you need, Adv. Neur. In., 30, <uri>https://proceedings.neurips.cc/paper/2017/hash/3f5ee243547dee91fbd053c1c4a845aa-Abstract.html</uri> (last access: 28 May 2026), 2017.</mixed-citation></ref>
      <ref id="bib1.bibx54"><label>Veldhuijsen et al.(2025)Veldhuijsen, van de Berg, Kuipers Munneke, Hansen, Boberg, Kittel, Amory, and van den Broeke</label><mixed-citation>Veldhuijsen, S. B. M., van de Berg, W. J., Kuipers Munneke, P., Hansen, N., Boberg, F., Kittel, C., Amory, C., and van den Broeke, M. R.: Emulating the expansion of Antarctic perennial firn aquifers in the 21st century, The Cryosphere, 19, 5157–5173, <ext-link xlink:href="https://doi.org/10.5194/tc-19-5157-2025" ext-link-type="DOI">10.5194/tc-19-5157-2025</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx55"><label>Wang et al.(2021)Wang, Zender, van As, Fausto, and Laffin</label><mixed-citation>Wang, W., Zender, C. S., van As, D., Fausto, R. S., and Laffin, M. K.: Greenland Surface Melt Dominated by Solar and Sensible Heating, Geophys. Res. Lett., 48, e2020GL090653, <ext-link xlink:href="https://doi.org/10.1029/2020GL090653" ext-link-type="DOI">10.1029/2020GL090653</ext-link>,  2021.</mixed-citation></ref>
      <ref id="bib1.bibx56"><label>Webber(2013)</label><mixed-citation>Webber, J. B. W.: A bi-symmetric log transformation for wide-range data, Meas. Sci. Technol., 24, 027001, <ext-link xlink:href="https://doi.org/10.1088/0957-0233/24/2/027001" ext-link-type="DOI">10.1088/0957-0233/24/2/027001</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx57"><label>Wesselkamp et al.(2025)</label><mixed-citation>Wesselkamp, M., Chantry, M., Pinnington, E., Choulga, M., Boussetta, S., Kalweit, M., Bödecker, J., Dormann, C. F., Pappenberger, F., and Balsamo, G.: Advances in land surface forecasting: a comparison of LSTM, gradient boosting, and feed-forward neural networks as prognostic state emulators in a case study with ecLand, Geosci. Model Dev., 18, 921–937, <ext-link xlink:href="https://doi.org/10.5194/gmd-18-921-2025" ext-link-type="DOI">10.5194/gmd-18-921-2025</ext-link>, 2025.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>Learning to melt: Emulating Greenland surface melt from a polar RCM with machine learning</article-title-html>
<abstract-html/>
<ref-html id="bib1.bib1"><label>Akiba et al.(2019)Akiba, Sano, Yanase, Ohta, and
Koyama</label><mixed-citation>
      
Akiba, T., Sano, S., Yanase, T., Ohta, T., and Koyama, M.: Optuna: A
next-generation hyperparameter optimization framework, in: Proceedings of the
25th ACM SIGKDD international conference on knowledge discovery &amp; data
mining,  2623–2631, <a href="https://doi.org/10.1145/3292500.3330701" target="_blank">https://doi.org/10.1145/3292500.3330701</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>Anilkumar et al.(2023)Anilkumar, Bharti, Chutia, and
Aggarwal</label><mixed-citation>
      
Anilkumar, R., Bharti, R., Chutia, D., and Aggarwal, S. P.: Modelling point
mass balance for the glaciers of the Central European Alps using
machine learning techniques, The Cryosphere, 17, 2811–2828,
<a href="https://doi.org/10.5194/tc-17-2811-2023" target="_blank">https://doi.org/10.5194/tc-17-2811-2023</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>Auffarth(2021)</label><mixed-citation>
      
Auffarth, B.: Machine learning for time-series with Python, Packt Publishing
United Kingdom, ISBN: 9781801819626, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>Bolibar et al.(2020)Bolibar, Rabatel, Gouttevin, Galiez, Condom, and
Sauquet</label><mixed-citation>
      
Bolibar, J., Rabatel, A., Gouttevin, I., Galiez, C., Condom, T., and Sauquet,
E.: Deep learning applied to glacier evolution modelling, The Cryosphere, 14,
565–584, <a href="https://doi.org/10.5194/tc-14-565-2020" target="_blank">https://doi.org/10.5194/tc-14-565-2020</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>Bolibar et al.(2022)Bolibar, Rabatel, Gouttevin, Zekollari, and
Galiez</label><mixed-citation>
      
Bolibar, J., Rabatel, A., Gouttevin, I., Zekollari, H., and Galiez, C.:
Nonlinear sensitivity of glacier mass balance to future climate change
unveiled by deep learning, Nat. Commun., 13, 409,
<a href="https://doi.org/10.1038/s41467-022-28033-0" target="_blank">https://doi.org/10.1038/s41467-022-28033-0</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>Chen and Guestrin(2016)</label><mixed-citation>
      
Chen, T. and Guestrin, C.: XGBoost: A Scalable Tree Boosting
System, in: Proceedings of the 22nd ACM SIGKDD International
Conference on Knowledge Discovery and Data Mining, KDD '16,
785–794, Association for Computing Machinery, New York, NY, USA, San Francisco,
California, USA, ISBN
9781450342322, <a href="https://doi.org/10.1145/2939672.2939785" target="_blank">https://doi.org/10.1145/2939672.2939785</a>,  2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>Cipolla et al.(2018)Cipolla, Gal, and
Kendall</label><mixed-citation>
      
Cipolla, R., Gal, Y., and Kendall, A.: Multi-task Learning Using
Uncertainty to Weigh Losses for Scene Geometry and Semantics, in:
2018 IEEE/CVF Conference on Computer Vision and Pattern
Recognition,   7482–7491, ISBN 2575-7075, <a href="https://doi.org/10.1109/CVPR.2018.00781" target="_blank">https://doi.org/10.1109/CVPR.2018.00781</a>,
2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>de Burgh-Day and Leeuwenburg(2023)</label><mixed-citation>
      
de Burgh-Day, C. O. and Leeuwenburg, T.: Machine learning for numerical weather
and climate modelling: a review, Geosci. Model Dev., 16,
6433–6477, <a href="https://doi.org/10.5194/gmd-16-6433-2023" target="_blank">https://doi.org/10.5194/gmd-16-6433-2023</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>de Roda Husman et al.(2024)de Roda Husman, Hu, van Tiggelen, Dell,
Bolibar, Lhermitte, Wouters, and
Munneke</label><mixed-citation>
      
de Roda Husman, S., Hu, Z., van Tiggelen, M., Dell, R., Bolibar, J., Lhermitte,
S., Wouters, B., and Munneke, P. K.: Physically-informed super-resolution
downscaling of Antarctic surface melt, Wiley Online
Library, J. Adv. Model.
Earth Syst., 16, e2023MS004212, <a href="https://doi.org/10.1029/2023MS004212" target="_blank">https://doi.org/10.1029/2023MS004212</a>,  2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>Dubey et al.(2022)Dubey, Singh, and
Chaudhuri</label><mixed-citation>
      
Dubey, S. R., Singh, S. K., and Chaudhuri, B. B.: Activation functions in deep
learning: A comprehensive survey and benchmark, Neurocomputing, 503,
92–108, <a href="https://doi.org/10.1016/j.neucom.2022.06.111" target="_blank">https://doi.org/10.1016/j.neucom.2022.06.111</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>Dunmire et al.(2024)Dunmire, Wever, Banwell, and
Lenaerts</label><mixed-citation>
      
Dunmire, D., Wever, N., Banwell, A. F., and Lenaerts, J. T. M.: Antarctic-wide
ice-shelf firn emulation reveals robust future firn air depletion signal for
the Antarctic Peninsula, Commun. Earth   Environ., 5, 100,
<a href="https://doi.org/10.1038/s43247-024-01255-4" target="_blank">https://doi.org/10.1038/s43247-024-01255-4</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>Fettweis et al.(2013)</label><mixed-citation>
      
Fettweis, X., Franco, B., Tedesco, M., van Angelen, J. H., Lenaerts, J. T. M.,
van den Broeke, M. R., and Gallée, H.: Estimating the Greenland ice sheet
surface mass balance contribution to future sea level rise using the regional
atmospheric climate model MAR, The Cryosphere, 7, 469–489,
<a href="https://doi.org/10.5194/tc-7-469-2013" target="_blank">https://doi.org/10.5194/tc-7-469-2013</a>,   2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>Fettweis et al.(2017)</label><mixed-citation>
      
Fettweis, X., Box, J. E., Agosta, C., Amory, C., Kittel, C., Lang, C., van As,
D., Machguth, H., and Gallée, H.: Reconstructions of the 1900–2015
Greenland ice sheet surface mass balance using the regional climate MAR
model, The Cryosphere, 11, 1015–1033, <a href="https://doi.org/10.5194/tc-11-1015-2017" target="_blank">https://doi.org/10.5194/tc-11-1015-2017</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>Fettweis et al.(2020)Fettweis, Hofer, Krebs-Kanzow, Amory, Aoki,
Berends, Born, Box, Delhasse, Fujita, Gierz, Goelzer, Hanna, Hashimoto,
Huybrechts, Kapsch, King, Kittel, Lang, Langen, Lenaerts, Liston, Lohmann,
Mernild, Mikolajewicz, Modali, Mottram, Niwano, Noël, Ryan, Smith,
Streffing, Tedesco, van de Berg, van den Broeke, van de Wal, van Kampenhout,
Wilton, Wouters, Ziemen, and Zolles</label><mixed-citation>
      
Fettweis, X., Hofer, S., Krebs-Kanzow, U., Amory, C., Aoki, T., Berends, C. J.,
Born, A., Box, J. E., Delhasse, A., Fujita, K., Gierz, P., Goelzer, H.,
Hanna, E., Hashimoto, A., Huybrechts, P., Kapsch, M.-L., King, M. D., Kittel,
C., Lang, C., Langen, P. L., Lenaerts, J. T. M., Liston, G. E., Lohmann, G.,
Mernild, S. H., Mikolajewicz, U., Modali, K., Mottram, R. H., Niwano, M.,
NoÃ«l, B., Ryan, J. C., Smith, A., Streffing, J., Tedesco, M., van de Berg,
W. J., van den Broeke, M., van de Wal, R. S. W., van Kampenhout, L., Wilton,
D., Wouters, B., Ziemen, F., and Zolles, T.: GrSMBMIP: intercomparison of
the modelled 1980–2012 surface mass balance over the Greenland Ice
Sheet, The Cryosphere, 14, 3935–3958, <a href="https://doi.org/10.5194/tc-14-3935-2020" target="_blank">https://doi.org/10.5194/tc-14-3935-2020</a>,
2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>Flora et al.(2024)Flora, Potvin, McGovern, and
Handler</label><mixed-citation>
      
Flora, M. L., Potvin, C. K., McGovern, A., and Handler, S.: A machine learning
explainability tutorial for atmospheric sciences, Artificial Intelligence for
the Earth Systems, 3, e230018, <a href="https://doi.org/10.1175/AIES-D-23-0018.1" target="_blank">https://doi.org/10.1175/AIES-D-23-0018.1</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>Glaude et al.(2024)</label><mixed-citation>
      
Glaude, Q., Noël, B., Olesen, M., Van den Broeke, M., van de Berg, W. J.,
Mottram, R., Hansen, N., Delhasse, A., Amory, C., and Kittel, C.: A factor
two difference in 21st-century Greenland ice sheet surface mass balance
projections from three regional climate models under a strong warming
scenario (SSP5-8.5), Geophys. Res. Lett., 51, e2024GL111902,
<a href="https://doi.org/10.1029/2024GL111902" target="_blank">https://doi.org/10.1029/2024GL111902</a>,
2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>Goelzer et al.(2013)</label><mixed-citation>
      
Goelzer, H., Huybrechts, P., Fürst, J., Nick, F., Andersen, M., Edwards, T.,
Fettweis, X., Payne, A., and Shannon, S.: Sensitivity of Greenland Ice
Sheet Projections to Model Formulations, J. Glaciol., 59,
733–749, <a href="https://doi.org/10.3189/2013JoG12J182" target="_blank">https://doi.org/10.3189/2013JoG12J182</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>Goodfellow et al.(2016)Goodfellow, Bengio, and
Courville</label><mixed-citation>
      
Goodfellow, I., Bengio, Y., and Courville, A.: Deep Learning, MIT Press,
<a href="https://www.deeplearningbook.org" target="_blank"/> (last access: 28 May 2026), 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>Géron(2019)</label><mixed-citation>
      
Géron, A.: Hands-on machine learning with Scikit-Learn, Keras, and
TensorFlow, O'Reilly Media, Inc., 2 Edn., ISBN 9781492032649, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>Hadjipetrou(2026)</label><mixed-citation>
      
Hadjipetrou, S.: A review of statistical methods for climate downscaling: the
underexplored potential of geostatistical simulation, Theor. Appl.
Climatol., 157, 216, <a href="https://doi.org/10.1007/s00704-026-06120-2" target="_blank">https://doi.org/10.1007/s00704-026-06120-2</a>, 2026.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>Hochreiter and Schmidhuber(1997)</label><mixed-citation>
      
Hochreiter, S. and Schmidhuber, J.: Long short-term memory, Neural Comput.,
9, 1735–1780,   1997.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>Hu et al.(2021)Hu, Kuipers Munneke, Lhermitte, Izeboud, and Van
Den Broeke</label><mixed-citation>
      
Hu, Z., Kuipers Munneke, P., Lhermitte, S., Izeboud, M., and Van Den Broeke,
M.: Improving surface melt estimation over the Antarctic Ice Sheet
using deep learning: a proof of concept over the Larsen Ice Shelf, The
Cryosphere, 15, 5639–5658, <a href="https://doi.org/10.5194/tc-15-5639-2021" target="_blank">https://doi.org/10.5194/tc-15-5639-2021</a>,  2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>Jeffries et al.(2014)Jeffries, Richter-Menge, and
Overland</label><mixed-citation>
      
Jeffries, M. O., Richter-Menge, J., and Overland, J. E.: Arctic report card
2014, Tech. rep.,
<a href="https://arctic.noaa.gov/report-card/report-card-archive/" target="_blank"/> (last access: 28 May 2026),
2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>Jiang et al.(2024)Jiang, Sweet, Blougouras, Brenning, Li, Reichstein,
Denzler, Shangguan, Yu, Huang, and Zscheischler</label><mixed-citation>
      
Jiang, S., Sweet, L.-b., Blougouras, G., Brenning, A., Li, W., Reichstein, M.,
Denzler, J., Shangguan, W., Yu, G., Huang, F., and Zscheischler, J.: How
Interpretable Machine Learning Can Benefit Process
Understanding in the Geosciences, Earth's Future, 12, e2024EF004540,
<a href="https://doi.org/10.1029/2024EF004540" target="_blank">https://doi.org/10.1029/2024EF004540</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>Kingma and Ba(2014)</label><mixed-citation>
      
Kingma, D. P. and Ba, J.: Adam: A method for stochastic optimization, arXiv
preprint arXiv:1412.6980, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>Langen et al.(2017)Langen, Fausto, Vandecrux, Mottram, and
Box</label><mixed-citation>
      
Langen, P. L., Fausto, R. S., Vandecrux, B., Mottram, R. H., and Box, J. E.:
Liquid Water Flow and Retention on the Greenland Ice Sheet in the
Regional Climate Model HIRHAM5: Local and Large-Scale
Impacts, Front. Earth Sci., 4,
<a href="https://doi.org/10.3389/feart.2016.00110" target="_blank">https://doi.org/10.3389/feart.2016.00110</a>,
2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>Lenaerts et al.(2020)Lenaerts, Camron, Wyburn-Powell, and
Kay</label><mixed-citation>
      
Lenaerts, J., Camron, M. D., Wyburn-Powell, C. R., and Kay, J. E.: Present-day
and future Greenland Ice Sheet precipitation frequency from CloudSat
observations and the Community Earth System Model, The Cryosphere,
14, 2253–2265, <a href="https://doi.org/10.5194/tc-14-2253-2020" target="_blank">https://doi.org/10.5194/tc-14-2253-2020</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>Lucas-Picher et al.(2012)</label><mixed-citation>
      
Lucas-Picher, P., Wulff-Nielsen, M., Christensen, J. H., Aðalgeirsdóttir,
G., Mottram, R., and Simonsen, S. B.: Very high resolution regional climate
model simulations over Greenland: Identifying added value, J.
Geophys. Res.-Atmos., 117, <a href="https://doi.org/10.1029/2011JD016267" target="_blank">https://doi.org/10.1029/2011JD016267</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>Lundberg and Lee(2017)</label><mixed-citation>
      
Lundberg, S. M. and Lee, S.-I.: A unified approach to interpreting model
predictions, in: Proceedings of the 31st International Conference on
Neural Information Processing Systems,  4768–4777, Curran
Associates Inc., Long Beach, California, USA, ISBN 9781510860964, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>Mankin et al.(2020)Mankin, Lehner, Coats, and
McKinnon</label><mixed-citation>
      
Mankin, J. S., Lehner, F., Coats, S., and McKinnon, K. A.: The value of initial
condition large ensembles to robust adaptation decision-making, Earth's
Future, 8, <a href="https://doi.org/10.1029/2020EF001610" target="_blank">https://doi.org/10.1029/2020EF001610</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>Meredith et al.(2019)</label><mixed-citation>
      
Meredith, M., Sommerkorn, M., Cassotta, S., Derksen, C., Ekaykin, A., Hollowed,
A., Kofinas, G., Mackintosh, A., Melbourne-Thomas, J., Muelbert, M.,
Ottersen, G., Pritchard, H., and Schuur, E.: Polar Regions, in: IPCC
Special Report on the Ocean and Cryosphere in a Changing Climate, edited by:
Pörtner, H.-O., Roberts, D. C., Masson-Delmotte, V.,
Zhai, P., Tignor, M., Poloczanska, E., Mintenbeck, K.,
Alegría, A., Nicolai, M., Okem, A., Petzold, J., Rama, B.,
and Weyer, N. M.,  Tech. rep., Cambridge University Press, Cambride,
UK and New York, NY, USA,
<a href="https://doi.org/10.1017/9781009157964.005" target="_blank">https://doi.org/10.1017/9781009157964.005</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>Molina et al.(2023)</label><mixed-citation>
      
Molina, M. J., O'Brien, T. A., Anderson, G., Ashfaq, M., Bennett, K. E.,
Collins, W. D., Dagon, K., Restrepo, J. M., and Ullrich, P. A.: A review of
recent and emerging machine learning applications for climate variability and
weather phenomena, Artificial Intelligence for the Earth Systems, 2,
220086, <a href="https://doi.org/10.1175/AIES-D-22-0086.1" target="_blank">https://doi.org/10.1175/AIES-D-22-0086.1</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>Molnar et al.(2022)Molnar, KÃ¶nig, Herbinger, Freiesleben, Dandl,
Scholbeck, Casalicchio, Grosse-Wentrup, and Bischl</label><mixed-citation>
      
Molnar, C., KÃ¶nig, G., Herbinger, J., Freiesleben, T., Dandl, S., Scholbeck,
C. A., Casalicchio, G., Grosse-Wentrup, M., and Bischl, B.: General
Pitfalls of Model-Agnostic Interpretation Methods for Machine
Learning Models, in: xxAI – Beyond Explainable AI:
International Workshop, Held in Conjunction with ICML 2020, July
18, 2020, Vienna, Austria, Revised and Extended Papers, edited by:
Holzinger, A., Goebel, R., Fong, R., Moon, T., MÃ¼ller, K.-R., and Samek, W.,
39–68, Springer International Publishing, Cham, ISBN 978-3-031-04083-2,
<a href="https://doi.org/10.1007/978-3-031-04083-2_4" target="_blank">https://doi.org/10.1007/978-3-031-04083-2_4</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>Mottram et al.(2017)Mottram, Boberg, Langen, Yang, Rodehacke,
Christensen, and Madsen</label><mixed-citation>
      
Mottram, R., Boberg, F., Langen, P., Yang, S., Rodehacke, C., Christensen,
J. H., and Madsen, M. S.: Surface mass balance of the Greenland ice sheet
in the regional climate model HIRHAM5: Present state and future
prospects, Low Temperature Science, 75, 105–115,
<a href="https://doi.org/10.14943/lowtemsci.75.105" target="_blank">https://doi.org/10.14943/lowtemsci.75.105</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>Noël et al.(2018)</label><mixed-citation>
      
Noël, B., van de Berg, W. J., van Wessem, J. M., van Meijgaard, E., van As, D., Lenaerts, J. T. M., Lhermitte, S., Kuipers Munneke, P., Smeets, C. J. P. P., van Ulft, L. H., van de Wal, R. S. W., and van den Broeke, M. R.: Modelling the climate and surface mass balance of polar ice sheets using RACMO2 – Part 1: Greenland (1958–2016), The Cryosphere, 12, 811–831, <a href="https://doi.org/10.5194/tc-12-811-2018" target="_blank">https://doi.org/10.5194/tc-12-811-2018</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>Ogunmolasuyi et al.(2025)Ogunmolasuyi, Meyer, McDowell,
Thompson-Munson, and Baker</label><mixed-citation>
      
Ogunmolasuyi, A., Meyer, C. R., McDowell, I., Thompson-Munson, M., and Baker,
I.: FirnLearn: A neural network-based approach to firn density modeling
in Antarctica, J. Glaciol., 71, e71, <a href="https://doi.org/10.1017/jog.2025.26" target="_blank">https://doi.org/10.1017/jog.2025.26</a>,
2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>Pan et al.(2025)Pan, Chen, Pan, Huang, Yang, Piao, Zhou, Dai, Chen,
and Li</label><mixed-citation>
      
Pan, X., Chen, D., Pan, B., Huang, X., Yang, K., Piao, S., Zhou, T., Dai, Y.,
Chen, F., and Li, X.: Evolution and prospects of Earth system models:
Challenges and opportunities, Earth-Sci. Rev., 260, 104986,
<a href="https://doi.org/10.1016/j.earscirev.2024.104986" target="_blank">https://doi.org/10.1016/j.earscirev.2024.104986</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>Reichstein et al.(2019)Reichstein, Camps-Valls, Stevens, Jung,
Denzler, Carvalhais, and Prabhat</label><mixed-citation>
      
Reichstein, M., Camps-Valls, G., Stevens, B., Jung, M., Denzler, J.,
Carvalhais, N., and Prabhat: Deep learning and process understanding for
data-driven Earth system science, Nature, 566, 195–204,
<a href="https://doi.org/10.1038/s41586-019-0912-1" target="_blank">https://doi.org/10.1038/s41586-019-0912-1</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>Richter-Menge et al.(2016)Richter-Menge, Overland, and
Mathis</label><mixed-citation>
      
Richter-Menge, J., Overland, J. E., and Mathis, J. T.: Arctic Report Card
2016: persistent warming trend and loss of sea ice are triggering extensive
Arctic changes, NOAA,
<a href="https://arctic.noaa.gov/report-card/report-card-archive/" target="_blank"/> (last access: 28 May 2026),
2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>Roberts et al.(2017)Roberts, Bahn, Ciuti, Boyce, Elith,
Guillera-Arroita, Hauenstein, Lahoz-Monfort, SchrÃ¶der, Thuiller, Warton,
Wintle, Hartig, and Dormann</label><mixed-citation>
      
Roberts, D. R., Bahn, V., Ciuti, S., Boyce, M. S., Elith, J., Guillera-Arroita,
G., Hauenstein, S., Lahoz-Monfort, J. J., SchrÃ¶der, B., Thuiller, W.,
Warton, D. I., Wintle, B. A., Hartig, F., and Dormann, C. F.:
Cross-validation strategies for data with temporal, spatial, hierarchical, or
phylogenetic structure, Ecography, 40, 913–929,
<a href="https://doi.org/10.1111/ecog.02881" target="_blank">https://doi.org/10.1111/ecog.02881</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>Sadler et al.(2022)Sadler, Appling, Read, Oliver, Jia, Zwart, and
Kumar</label><mixed-citation>
      
Sadler, J. M., Appling, A. P., Read, J. S., Oliver, S. K., Jia, X., Zwart,
J. A., and Kumar, V.: Multi-Task Deep Learning of Daily Streamflow
and Water Temperature, Water Resour. Res., 58, e2021WR030138,
<a href="https://doi.org/10.1029/2021WR030138" target="_blank">https://doi.org/10.1029/2021WR030138</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>Schlager(2026a)</label><mixed-citation>
      
Schlager, E.: Code of Learning to melt: Emulating Greenland surface melt
from a polar RCM with machine learning, Zenodo, <a href="https://doi.org/10.5281/zenodo.20271069" target="_blank">https://doi.org/10.5281/zenodo.20271069</a>,
2026a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>Schlager(2026b)</label><mixed-citation>
      
Schlager, E.: Output of Learning to melt: Emulating Greenland surface
melt from a polar RCM with machine learning,  Zenodo, <a href="https://doi.org/10.5281/zenodo.19627367" target="_blank">https://doi.org/10.5281/zenodo.19627367</a>,
2026b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>Sellevold and Vizcaino(2021)</label><mixed-citation>
      
Sellevold, R. and Vizcaino, M.: First Application of Artificial Neural
Networks to Estimate 21st Century Greenland Ice Sheet Surface
Melt, Geophys. Res. Lett., 48, <a href="https://doi.org/10.1029/2021GL092449" target="_blank">https://doi.org/10.1029/2021GL092449</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>Sun et al.(2024)Sun, Deng, Ren, Liu, Deng, and Jin</label><mixed-citation>
      
Sun, Y., Deng, K., Ren, K., Liu, J., Deng, C., and Jin, Y.: Deep learning in
statistical downscaling for deriving high spatial resolution gridded
meteorological data: A systematic review, ISPRS J. Photogramm., 208, 14–38, <a href="https://doi.org/10.1016/j.isprsjprs.2023.12.011" target="_blank">https://doi.org/10.1016/j.isprsjprs.2023.12.011</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>Tebaldi et al.(2025)Tebaldi, Selin, Ferrari, and
Flierl</label><mixed-citation>
      
Tebaldi, C., Selin, N. E., Ferrari, R., and Flierl, G.: Emulators of climate
model output, Ann. Rev. Environ. Res., 50,
<a href="https://doi.org/10.1146/annurev-environ-012125-085838" target="_blank">https://doi.org/10.1146/annurev-environ-012125-085838</a>,  2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>The IMBIE Team(2020)</label><mixed-citation>
      
The IMBIE Team: Mass balance of the Greenland Ice Sheet from 1992 to
2018, Nature, 579, 233–239, <a href="https://doi.org/10.1038/s41586-019-1855-2" target="_blank">https://doi.org/10.1038/s41586-019-1855-2</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib48"><label>Theng and Bhoyar(2024)</label><mixed-citation>
      
Theng, D. and Bhoyar, K. K.: Feature selection techniques for machine learning:
a survey of more than two decades of research, Know. Inf.
Syst., 66, 1575–1637, <a href="https://doi.org/10.1007/s10115-023-02010-5" target="_blank">https://doi.org/10.1007/s10115-023-02010-5</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib49"><label>Tyralis et al.(2021)Tyralis, Papacharalampous, and
Langousis</label><mixed-citation>
      
Tyralis, H., Papacharalampous, G., and Langousis, A.: Super ensemble learning
for daily streamflow forecasting: Large-scale demonstration and comparison
with multiple machine learning algorithms, Neural Comput. Appl.,
33, 3053–3068, <a href="https://doi.org/10.1007/s00521-020-05172-3" target="_blank">https://doi.org/10.1007/s00521-020-05172-3</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib50"><label>van der Meer et al.(2023)van der Meer, De Roda Husman, and
Lhermitte</label><mixed-citation>
      
van der Meer, M., De Roda Husman, S., and Lhermitte, S.: Deep Learning
Regional Climate Model Emulators: A Comparison of Two
Downscaling Training Frameworks, J. Adv. Model. Earth
Syst., 15, e2022MS003593, <a href="https://doi.org/10.1029/2022MS003593" target="_blank">https://doi.org/10.1029/2022MS003593</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib51"><label>Vandecrux et al.(2019)</label><mixed-citation>
      
Vandecrux, B., MacFerrin, M., Machguth, H., Colgan, W. T., van As, D., Heilig,
A., Stevens, C. M., Charalampidis, C., Fausto, R. S., Morris, E. M.,
Mosley-Thompson, E., Koenig, L., Montgomery, L. N., Miège, C., Simonsen,
S. B., Ingeman-Nielsen, T., and Box, J. E.: Firn data compilation reveals
widespread decrease of firn air content in western Greenland, The
Cryosphere, 13, 845–859, <a href="https://doi.org/10.5194/tc-13-845-2019" target="_blank">https://doi.org/10.5194/tc-13-845-2019</a>, 2019.


    </mixed-citation></ref-html>
<ref-html id="bib1.bib52"><label>Vandecrux et al.(2024)</label><mixed-citation>
      
Vandecrux, B., Fausto, R. S., Box, J. E., Covi, F., Hock, R., Rennermalm,
Ã. K., Heilig, A., Abermann, J., van As, D., Bjerre, E., Fettweis, X.,
Smeets, P. C. J. P., Kuipers Munneke, P., van den Broeke, M. R., Brils, M.,
Langen, P. L., Mottram, R., and Ahlström, A. P.: Recent warming trends of
the Greenland ice sheet documented by historical firn and ice temperature
observations and machine learning, The Cryosphere, 18, 609–631,
<a href="https://doi.org/10.5194/tc-18-609-2024" target="_blank">https://doi.org/10.5194/tc-18-609-2024</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib53"><label>Vaswani et al.(2017)Vaswani, Shazeer, Parmar, Uszkoreit, Jones,
Gomez, Kaiser, and Polosukhin</label><mixed-citation>
      
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N.,
Kaiser, Å., and Polosukhin, I.: Attention is all you need, Adv. Neur.
In., 30, <a href="https://proceedings.neurips.cc/paper/2017/hash/3f5ee243547dee91fbd053c1c4a845aa-Abstract.html" target="_blank"/> (last access: 28 May 2026), 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib54"><label>Veldhuijsen et al.(2025)Veldhuijsen, van de Berg, Kuipers Munneke,
Hansen, Boberg, Kittel, Amory, and van den
Broeke</label><mixed-citation>
      
Veldhuijsen, S. B. M., van de Berg, W. J., Kuipers Munneke, P., Hansen, N.,
Boberg, F., Kittel, C., Amory, C., and van den Broeke, M. R.: Emulating the
expansion of Antarctic perennial firn aquifers in the 21st century, The
Cryosphere, 19, 5157–5173, <a href="https://doi.org/10.5194/tc-19-5157-2025" target="_blank">https://doi.org/10.5194/tc-19-5157-2025</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib55"><label>Wang et al.(2021)Wang, Zender, van As, Fausto, and
Laffin</label><mixed-citation>
      
Wang, W., Zender, C. S., van As, D., Fausto, R. S., and Laffin, M. K.:
Greenland Surface Melt Dominated by Solar and Sensible Heating,
Geophys. Res. Lett., 48, e2020GL090653,
<a href="https://doi.org/10.1029/2020GL090653" target="_blank">https://doi.org/10.1029/2020GL090653</a>,  2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib56"><label>Webber(2013)</label><mixed-citation>
      
Webber, J. B. W.: A bi-symmetric log transformation for wide-range data,
Meas. Sci. Technol., 24, 027001,
<a href="https://doi.org/10.1088/0957-0233/24/2/027001" target="_blank">https://doi.org/10.1088/0957-0233/24/2/027001</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib57"><label>Wesselkamp et al.(2025)</label><mixed-citation>
      
Wesselkamp, M., Chantry, M., Pinnington, E., Choulga, M., Boussetta, S., Kalweit, M., Bödecker, J., Dormann, C. F., Pappenberger, F., and Balsamo, G.: Advances in land surface forecasting: a comparison of LSTM, gradient boosting, and feed-forward neural networks as prognostic state emulators in a case study with ecLand, Geosci. Model Dev., 18, 921–937, <a href="https://doi.org/10.5194/gmd-18-921-2025" target="_blank">https://doi.org/10.5194/gmd-18-921-2025</a>, 2025.

    </mixed-citation></ref-html>--></article>
