Accelerated decline of Svalbard coasts fast ice as a result of climate
change

Urbański, Jacek A.; Litwicka, Dagmara

doi:https://doi.org/10.5194/tc-2021-21

Preprints

https://doi.org/10.5194/tc-2021-21

Preprints

12 Mar 2021

| 12 Mar 2021

Status: this discussion paper is a preprint. It has been under review for the journal The Cryosphere (TC). The manuscript was not accepted for further review after discussion.

Accelerated decline of Svalbard coasts fast ice as a result of climate change

Jacek A. Urbański and Dagmara Litwicka

Abstract. In the Arctic, it is the Svalbard Archipelago that has experienced some of the most severe temperature increases in the last three decades. The temperature rise has accelerated de-icing along the archipelago's coasts, bringing changes to the local environment. As the fast ice distribution along Svalbard coasts before 2000 is mainly unknown, we use in situ observation data of the ice extent for the period of 2005–2018 to create a new geographic random forest model in order to predict daily ice extents using freezing and thawing degree days and time of ice season. This allows one to reconstruct the ice extent in the past and predict it in the near future from standard meteorological data with an accuracy of 0.95. The mean, at least two-month ice extent of fast sea ice along Svalbard coasts was about 12,000 km² between 1973 and 2000. In 2005–2018, however, the same ice extent declined to 8,000 km². Comparison of the periods 2005–2018 and 2014–2019 shows the accelerating decline of fast ice: the two-month fast ice extent is now only 6,000 km². A further increase in mean winter air temperatures by two degrees will result in a two-month fast ice extent of 2,000 km².

Received: 18 Jan 2021 – Discussion started: 12 Mar 2021

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.

Download & links

Jacek A. Urbański and Dagmara Litwicka

Status: closed

RC1: 'Comment on tc-2021-21', Anonymous Referee #1, 09 Apr 2021

Review of TC-2021-21 by J. UrbaÅski and D. Litwicka

Review:

Although this manuscript presents some promise as a demonstration of harnessing the power of Machine Learning (ML) for analysis of time series, it falls short in a number of key areas. These are detailed below in "Major issues". I also list many minor issues following this. I feel there is considerable work required to address the major comments, unfortunately.

Major issues:

1) The structure of the manuscript is OK up until L155, but quite poor from then onward. For example, there is no "Results" section. Results are, instead, scattered throughout the remainder of the manuscript, and even right up until the final paragraph of the conclusion. A revised manuscript would strongly benefit from containing all new results within a results section, then sticking to the tried-and-true Discussion and Conclusions following this. I find that the Elsevier 11 step guide to formatting a scientific paper helps here: https://www.elsevier.com/connect/11-steps-to-structuring-a-science-paper-editors-will-take-seriously

2) I have some serious reservations about the method underlying the conclusions in this paper. These are expanded upon in three parts:

a) In line 127 you say that for Random Forests, "extrapolation beyond the range of values in the training set is not possible" - however this is exactly what you do, both back in time (where you have shown the Freezing Degree Day (FDD) values are higher than in the training set), and forward in time (where FDDs are much lower). Ideally you'd want to split training/testing toward both the end and start of your validation time series to see if the model performs similarly at both ends - this might then give some confidence for extrapolation, but without this, it's unconvincing. In fact, I believe the discrepancy between observed and modeled fast ice in Fig 10 may be evidence of this poor performance when extrapolating.

b) You indicate that the default hyperparameters generally achieve good results, but i) there is no attempt to verify this in any way for this study, and ii) these default hyperparameters are not given anywhere in the manuscript. You don't even really state that you used the default hyperparameters. Also, without a sensitivity test for hyperparameter selection, you're essentially "breaking" the paradigm of using a training/cross-validation/testing split (because you aren't able to use the cross-validation properly). You also don't indicate how temporal and spatial autocorrelation are avoided in selecting the training dataset. From your description on Line 135, I half suspect you didn't even try to avoid this (e.g., neighbouring pixels are going to be highly correlated. Choosing one pixel for the training and the neighbouring pixel for the testing is obviously (and inappropriately) going to confer high skill to the model). In light of these suspicions, I suspect your very high claimed R^2 of 0.95 is a symptom of this, and that the "true" performance of the model, (e.g., when the validation data are not very highly correlated with the training data) is much lower.

3) The manuscipt suffers from a lack of clarity and an excess of unnecessary jargon - particularly in the Methods sections 3 and 4 (e.g., the definition of OOB which is neither used again nor even referred to). I also don't need to know details like the fact that your vector layers were zipped.

4) Figures are clear, but their presentation quality is not ideal. Many figures suffer from a lack of proper sentence case; lack of labels; lack of units; lack of sufficient caption; or strange red underlines of text suggestive of a screenshot of a word processor.

5) A major reference is completely missing from consideration: Yu et al., 2014, Journal of Climate (doi:10.1175/JCLI-D-13-00178.1). In this work, the authors (of which I am not one) digitise pan-Arctic fast ice charts back to 1976, not only including Svalbard, but also analysing the decrease in fast ice around Svalbard. Thus, your sentence in Line 55 "Unfortunately, this distribution in the last quarter of the 20th century is unknown" is not accurate. This might have considerable consequences for the justifaction of your work. However, I still believe there is value in your technique. But I don't think your work, which essentially attempts to model fast ice extent in this area, can be published without some kind of comparison to the Yu dataset. To be clearer, I think your work should be re-formed around a comparison with the Yu dataset back in time - essentially treating the Yu dataset as truth (NB - as a general statement, in my experience, ice charting of fast ice can be, at times, quite far from the truth! But there's certainly nothing better without re-interpreting the satellite imagery yourself).

6) I'm unconvinced that your choice of 0 C is appropriate as a Freezing Degree Days threshold. I was under the opinion that 0 C should only be used for fresh ice. I checked the Lepparanta 1993 paper and, although it's not explicitly stated, close reading reveals that it's clear that a value of around -2 C should be used for sea ice. I realise the Polar Science Center mentions a value of 0 C on their website, but the intended ice salinity is ambiguous on that site and still think this value should only be used for studies of formation/melt of fresh ice.

7) Fig 9 has a couple of major problems: Panel a): Your color scale shows fast ice time difference, but doesn't take into account that sometimes this difference is negative (e.g., around the northeast of Edge Island) - i.e., negative and positive differences are represented in the same color - not sure this is appropriate. Panel b): The diagonal lines going to the Isfjord higher-resolution domain imply that this is an enlargement, but it's not the case, as can be seen by the clearly different colours. This implies that your result is strongly dependent on the resolution of your hexagonal grid. But why is this the case? There are only three inputs to your model: FDD, TDD and day number of the ice season. And only one output: the ice coverage, which is trained by the ice charts. Which of these 4 things is different? (This brings me to another point: How was a spatially-complete field of TDD and FDD generated? Is this what's attempted to be explained at L79? Linear interpolation between three observer stations? Even assuming the grid scale change results in a slightly different FDD and TDD across the two different grids, it should still be very similar). Anyway, I really can't understand why grid scale has such a huge effect on the result given the inputs are so simple.

8) Even after reading the relevant sections a couple of times, I can't quite figure out what exactly you asked your RF model to produce. My interpretation is that you fed your RF time series (daily? Or at the resolution of the ice charts?) of TDD, FDD and day of ice season, and asked it to produce time series of fast ice coverage, as trained by ice charts. Is that the case? Then you compared this time series with ice charts (in the validation set, many times). So you end up with two (daily?) time series of fast ice cover for each grid cell: One from the RF and one from the validation data (the ice charts). Are these both binary? Or is the RF-derived prediction a smoothly-varying value between 0 and 1? From these, as detailed in L135, you "evaluated the error using RMS". RMS probably isn't a good metric to use if both outputs are binary. If the RF output is smoothly varying it might be OK though. However, I have a major concern in that a majority of these values require absolutely no skill to predict (i.e., on ice season day 1, a blanket prediction of 0 fast ice is probably a good idea everywhere). Indeed, as the vast majority of the grid cells in your domain never have fast ice (as shown in Fig. 8), a blanket prediction of "0" for every cell and for every time step is probably not bad. Basically, I am suspicious that your RMS statistic is artificially lowered by the fact that almost all grid cells never have fast ice. Similarly, your R^2 metric may also be affected. Even for those cells with fast ice cover, there is a skew toward lower fast ice coverage, so this warrants a more appropriate consideration of errors, such as using precision/recall/F1 score instead of a simple RMS.

9) Your abstract tells a story of an accelerating decline of fast ice extent. However, you never show a fast ice time series in any of your figures. The closest we get to see is a reduction in FDD in Fig 7 - however this isn't a paper about a reduction in FDD! Given you asked your RF to recreate fast ice extent, I think we need to see a time series of the primary output. In fact, in Line 188 you draw conclusions about fast ice directly from this plot of FDD - but would be so much more believable if you plotted a time series of fast ice extent.

Minor comments:

8: I don't think ice charts count as in situ observations.

13: It's not clear why you compare a 14 y period with a 6 y period. These seem quite arbitrary.

14: Avoid "now" - especially since your most recent data are already 2 years old.

14: What time period does a further two degrees warming correspond to?

17: Using the 66 degrees 33 min N definition of the Arctic, this doesn't seem to be true (much more land area in Scandanavia).

17: I don't believe that the location of Svalbard is strongly influenced by any current.

27: Needs a reference.

31: Needs a reference.

34: Extraneous space in this line (and a few other places in the manuscript).

46: Citation style inconsistence with Pavlov ref. Also extraneous hyphen.

Fig 1 caption: Missin degrees and minutes.

62: Most of these datasets are not in situ.

65: Missing circle over capital A (sorry, I don't know the correct name) in Alesund. Elsewhere in manuscript too.

75: We don't need to know the details of the libraries used.

Fig 2 caption: No mention of the date of this chart.

121: Should be just "Breiman (2001)"

171-177: Much of this is repeated - and it also reads like a conclusion.

Fig 7 caption: I think "since" should be "prior to"

186: Repeat of earlier.

191: Significance never tested.

192: Using units of deg C.day feels so imappropriate here, despite being numerically correct. K.day would be preferable.

Fig 8: Why are the two timescales in the comparison so different? In general, throughout the manuscript, the timescales being considered are not well-justified.

Fig 8 caption: Wait a second. The first paragraph of the discussion says you don't use a geographically-weighted random forest - but here and in Fig 10 you say that you do.

221: Typo in Conclusion.

228: "a bit" is too colloquial. Also, why are these points not annotated on Fig. 10?

233: What years to +2 and +4 degrees correspond to?

241: "wits" typo.

242: "rice" typo. Also this whole paragraph is not appropriate in the conclusions section.

275: double comma.

277: "influence of"

314: Capital B for Bay needed.

Does the paper address relevant scientific questions within the scope of TC?

Yes.

Does the paper present novel concepts, ideas, tools, or data?

Yes.

Are substantial conclusions reached?

Yes.

Are the scientific methods and assumptions valid and clearly outlined?

No.

Are the results sufficient to support the interpretations and conclusions?

No - questionable results stemming from choice of FDD threshold and extrapolation of random forest technique. Questionable methodology in the choice to not use cross-validation, nor to test effect of hyperparameters. Questionable methodology for selection of training data (suspect data independence not guaranteed)

Is the description of experiments and calculations sufficiently complete and precise to allow their reproduction by fellow scientists (traceability of results)?

No - Random forest hyperparameters not given. Methods section readability is low due to focus on unimportant details, e.g., file formats.

Do the authors give proper credit to related work and clearly indicate their own new/original contribution?

Yes.

Does the title clearly reflect the contents of the paper?

Yes.

Does the abstract provide a concise and complete summary?

No.

Is the overall presentation well structured and clear?

No - structure is poor from line 155 onward (e.g., no "Results" section; results scattered throughout discussion and conclusion).

Is the language fluent and precise?

Not always - but not a major problem.

Are mathematical formulae, symbols, abbreviations, and units correctly defined and used?

Yes.

Should any parts of the paper (text, formulae, figures, tables) be clarified, reduced, combined, or eliminated?

Yes - extensive clarification and reduction of duplicate information required.

Are the number and quality of references appropriate?

No - Important reference missing completely (Yu et al., 2014. doi:10.1175/JCLI-D-13-00178.1).

Is the amount and quality of supplementary material appropriate?

N/A.

Citation: https://doi.org/10.5194/tc-2021-21-RC1
RC2:
'Comment on tc-2021-21', Angelika Renner, 03 May 2021
Review – UrbaÅski & Litwicka, Accelerated decline of Svalbard coasts fast ice as a result of climate change, The Cryosphere Discussions

The authors investigate the development of land-fast sea ice around Svalbard using a geographical random forest model. The model is built based on ice charts and freezing and thawing degree days. Results are then used to discuss fast ice extent in the period 1973-2019. The approach is interesting but the manuscript has major shortcomings both in the explanation of the method and the overall structure of the manuscript. I therefore cannot recommend publication in its current form.

Major concerns:

A proper results section is missing. Instead, the authors go straight from the description of the random forest model to a discussion, interspersing bits of results with interpretation. It is impossible for the reader to get an impression of the model performance as the main results – the fast ice extent in Isfjorden and around Svalbard over time – is not shown anywhere. The conclusion section contains more results and little in terms of actual conclusions. I strongly recommend restructuring the manuscript: firstly, a basic results section which shows the model results both regarding the spatial distribution of fast ice in the two model domains and the temporal development, and provides a clear overview and summary of model errors, is needed. Then, a comparison of the model results to already published records of fast ice extent would further strengthen trust in model performance before one tries to extend the time series of fast ice extent backwards in time. A proper discussion of strengths and limitations of the model would be useful.

As far as I can see, the only variables included to predict fast ice formation are (positive and negative) freezing degree days. To derive freezing degree days, the authors use temperature records from Hopen, Isfjorden and Kongsfjorden, i.e. in the central Barents Sea and on the west coast of Svalbard. As the spatial distribution of the model performance is not shown, I cannot help but find this highly problematic – conditions are vastly different along the western and the northern and eastern side of Svalbard due to the different hydrographic regimes (periodic Atlantic Water inflow preventing ice formation in the west vs predominantly Arctic Waters in the north and east). In line with the comment above, I suggest to improve the presentation of model results and errors, and to include a discussion regarding validity of the chosen predictor variables in different regions of the model.

Other comments:

Line 46: The authors refer to Pavlov et al., 2019 (formatting needs correcting), but shouldn’t this be Pavlova et al, 2019, from the volume? I.e.: Pavlova O., Gerland S., Hop H. (2019) Changes in Sea-Ice Extent and Thickness in Kongsfjorden, Svalbard (2003–2016). In: Hop H., Wiencke C. (eds) The Ecosystem of Kongsfjorden, Svalbard. Advances in Polar Ecology, vol 2. Springer, Cham. https://doi.org/10.1007/978-3-319-46425-1_4

Line 46: “There has been less fast ice off the northern coasts” – is this really what the authors mean? Or rather that there have been fewer studies on fast ice along the north coast? Fast ice is rather extensive in the north, and more so than in e.g. Kongsfjorden or Isfjorden due to a more Arctic climate.

For the introduction: there have been more recent publications about sea ice cover in Svalbard and in Isfjorden in particular which the authors should consider including:

Dahlke S, Hughes NE, Wagner PM, et al. The observed recent surface air temperature development across Svalbard and concurring footprints in local sea ice cover. Int J Climatol. 2020;40:5246–5265. https://doi.org/10.1002/joc.6517DAHLKE ET AL. 5265

Skogseth, L.L.A. Olivier, F. Nilsen, E. Falck, N. Fraser, V. Tverberg, A.B. Ledang, A. Vader, M.O. Jonassen, J. Søreide, F. Cottier, J. Berge, B.V. Ivanov, S. Falk-Petersen

Variability and decadal trends in the Isfjorden (Svalbard) ocean climate and circulation – an indicator for climate change in the European Arctic

Progr Oceanogr, 187 (2020), 10.1016/j.pocean.2020.102394

Section 2: Please clarify: Is the record from Barentsburg used for Isfjorden? And from Ny-Ålesund for Kongsfjorden?

Section 3: I don’t understand why the authors estimate missing TAVG from interpolated maximum and minimum temperatures – why not interpolating TAVG directly?

Are FDD and TDD estimated for Isfjorden only or for the other locations as well? And how is this then combined in the model for the entire Svalbard archipelago?

Please provide a proper reference for the choice of T_f = 0 degree C. While this is true for freshwater, is is not accurate for fjord waters, but given the wide range of possible salinities in fjords, it probably wouldn’t make much difference.

There is a lot of unnecessary detail in the last part of Section 3 which could be shortened.

Line 120: In Figure 5, it looks like ICESD is used as feature, not ICESN. Which one is correct?

Line 127-128: Here, it is stated that one of the model’s shortcomings is that extrapolation is not possible – however, isn’t that exactly what the authors are attempting? Please clarify, also regarding which range of values is meant (ice-cover vs no ice cover, or range in FDD/TDD, or time period?).

Line 134-135: Please provide an explanation what these results mean and how they are estimated.

Line 137: How is the limit for “satisfactory” set? And what about the Svalbard model?

Figure 4: I would find it more useful to see a time series of fast ice extent instead of what looks like a histogram. In general, if the main model result is a time series and/or distribution of fast ice extent, this should be shown somewhere (both for the Isfjorden and the Svalbard model).

Since the aim is to model fast ice, is there a check whether a hexagon with ice cover is connected to land through other hexagons with ice cover?

Line 160-161: There needs to be a discussion whether this holds true for the entire period, given the known influence of Atlantic Water inflow periods on ice cover in western Svalbard fjords (e.g. Pavlov et al., 2013, Tverberg et al., 2019, Nilsen et al., 2008, Skogseth et al., 2020)

Figure 6: Is there a spatial trend in these, given that the input data for FDD are all in the west or south of Svalbard?

Line 177-178: Gerland et al., 2008 show data for Hopen, not Isfjorden. It would be very helpful to actually see the comparison somewhere!

Figure 7: How did you determine 2005 as the start of the warmer winters?

Figure 8: Without having seen a direct comparison between model and observations for the same period, I find this figure little useful. It is impossible to assess whether the differences between a) and b) are truly because of changes over time, or whether they are influenced by model artifacts or errors. It would make much more sense to show the model results for both periods, or give a proper model-observation comparison first.

Line 203: I guess it’s debatable whether some of the features along the east coast could be considered fjords… However, there is considerable fast ice in Storfjorden.

Section 5 & 6: I’m confused by the different periods considered: How are they chosen? And why is there suddenly in the Conclusion the period 2014-2019? Please provide a justification for the different periods and then use them consistently throughout results, discussion and conclusion.
Citation: https://doi.org/10.5194/tc-2021-21-RC2

Status: closed

RC1: 'Comment on tc-2021-21', Anonymous Referee #1, 09 Apr 2021

Review of TC-2021-21 by J. UrbaÅski and D. Litwicka

Review:

Although this manuscript presents some promise as a demonstration of harnessing the power of Machine Learning (ML) for analysis of time series, it falls short in a number of key areas. These are detailed below in "Major issues". I also list many minor issues following this. I feel there is considerable work required to address the major comments, unfortunately.

Major issues:

1) The structure of the manuscript is OK up until L155, but quite poor from then onward. For example, there is no "Results" section. Results are, instead, scattered throughout the remainder of the manuscript, and even right up until the final paragraph of the conclusion. A revised manuscript would strongly benefit from containing all new results within a results section, then sticking to the tried-and-true Discussion and Conclusions following this. I find that the Elsevier 11 step guide to formatting a scientific paper helps here: https://www.elsevier.com/connect/11-steps-to-structuring-a-science-paper-editors-will-take-seriously

2) I have some serious reservations about the method underlying the conclusions in this paper. These are expanded upon in three parts:

a) In line 127 you say that for Random Forests, "extrapolation beyond the range of values in the training set is not possible" - however this is exactly what you do, both back in time (where you have shown the Freezing Degree Day (FDD) values are higher than in the training set), and forward in time (where FDDs are much lower). Ideally you'd want to split training/testing toward both the end and start of your validation time series to see if the model performs similarly at both ends - this might then give some confidence for extrapolation, but without this, it's unconvincing. In fact, I believe the discrepancy between observed and modeled fast ice in Fig 10 may be evidence of this poor performance when extrapolating.

b) You indicate that the default hyperparameters generally achieve good results, but i) there is no attempt to verify this in any way for this study, and ii) these default hyperparameters are not given anywhere in the manuscript. You don't even really state that you used the default hyperparameters. Also, without a sensitivity test for hyperparameter selection, you're essentially "breaking" the paradigm of using a training/cross-validation/testing split (because you aren't able to use the cross-validation properly). You also don't indicate how temporal and spatial autocorrelation are avoided in selecting the training dataset. From your description on Line 135, I half suspect you didn't even try to avoid this (e.g., neighbouring pixels are going to be highly correlated. Choosing one pixel for the training and the neighbouring pixel for the testing is obviously (and inappropriately) going to confer high skill to the model). In light of these suspicions, I suspect your very high claimed R^2 of 0.95 is a symptom of this, and that the "true" performance of the model, (e.g., when the validation data are not very highly correlated with the training data) is much lower.

3) The manuscipt suffers from a lack of clarity and an excess of unnecessary jargon - particularly in the Methods sections 3 and 4 (e.g., the definition of OOB which is neither used again nor even referred to). I also don't need to know details like the fact that your vector layers were zipped.

4) Figures are clear, but their presentation quality is not ideal. Many figures suffer from a lack of proper sentence case; lack of labels; lack of units; lack of sufficient caption; or strange red underlines of text suggestive of a screenshot of a word processor.

5) A major reference is completely missing from consideration: Yu et al., 2014, Journal of Climate (doi:10.1175/JCLI-D-13-00178.1). In this work, the authors (of which I am not one) digitise pan-Arctic fast ice charts back to 1976, not only including Svalbard, but also analysing the decrease in fast ice around Svalbard. Thus, your sentence in Line 55 "Unfortunately, this distribution in the last quarter of the 20th century is unknown" is not accurate. This might have considerable consequences for the justifaction of your work. However, I still believe there is value in your technique. But I don't think your work, which essentially attempts to model fast ice extent in this area, can be published without some kind of comparison to the Yu dataset. To be clearer, I think your work should be re-formed around a comparison with the Yu dataset back in time - essentially treating the Yu dataset as truth (NB - as a general statement, in my experience, ice charting of fast ice can be, at times, quite far from the truth! But there's certainly nothing better without re-interpreting the satellite imagery yourself).

6) I'm unconvinced that your choice of 0 C is appropriate as a Freezing Degree Days threshold. I was under the opinion that 0 C should only be used for fresh ice. I checked the Lepparanta 1993 paper and, although it's not explicitly stated, close reading reveals that it's clear that a value of around -2 C should be used for sea ice. I realise the Polar Science Center mentions a value of 0 C on their website, but the intended ice salinity is ambiguous on that site and still think this value should only be used for studies of formation/melt of fresh ice.

7) Fig 9 has a couple of major problems: Panel a): Your color scale shows fast ice time difference, but doesn't take into account that sometimes this difference is negative (e.g., around the northeast of Edge Island) - i.e., negative and positive differences are represented in the same color - not sure this is appropriate. Panel b): The diagonal lines going to the Isfjord higher-resolution domain imply that this is an enlargement, but it's not the case, as can be seen by the clearly different colours. This implies that your result is strongly dependent on the resolution of your hexagonal grid. But why is this the case? There are only three inputs to your model: FDD, TDD and day number of the ice season. And only one output: the ice coverage, which is trained by the ice charts. Which of these 4 things is different? (This brings me to another point: How was a spatially-complete field of TDD and FDD generated? Is this what's attempted to be explained at L79? Linear interpolation between three observer stations? Even assuming the grid scale change results in a slightly different FDD and TDD across the two different grids, it should still be very similar). Anyway, I really can't understand why grid scale has such a huge effect on the result given the inputs are so simple.

8) Even after reading the relevant sections a couple of times, I can't quite figure out what exactly you asked your RF model to produce. My interpretation is that you fed your RF time series (daily? Or at the resolution of the ice charts?) of TDD, FDD and day of ice season, and asked it to produce time series of fast ice coverage, as trained by ice charts. Is that the case? Then you compared this time series with ice charts (in the validation set, many times). So you end up with two (daily?) time series of fast ice cover for each grid cell: One from the RF and one from the validation data (the ice charts). Are these both binary? Or is the RF-derived prediction a smoothly-varying value between 0 and 1? From these, as detailed in L135, you "evaluated the error using RMS". RMS probably isn't a good metric to use if both outputs are binary. If the RF output is smoothly varying it might be OK though. However, I have a major concern in that a majority of these values require absolutely no skill to predict (i.e., on ice season day 1, a blanket prediction of 0 fast ice is probably a good idea everywhere). Indeed, as the vast majority of the grid cells in your domain never have fast ice (as shown in Fig. 8), a blanket prediction of "0" for every cell and for every time step is probably not bad. Basically, I am suspicious that your RMS statistic is artificially lowered by the fact that almost all grid cells never have fast ice. Similarly, your R^2 metric may also be affected. Even for those cells with fast ice cover, there is a skew toward lower fast ice coverage, so this warrants a more appropriate consideration of errors, such as using precision/recall/F1 score instead of a simple RMS.

9) Your abstract tells a story of an accelerating decline of fast ice extent. However, you never show a fast ice time series in any of your figures. The closest we get to see is a reduction in FDD in Fig 7 - however this isn't a paper about a reduction in FDD! Given you asked your RF to recreate fast ice extent, I think we need to see a time series of the primary output. In fact, in Line 188 you draw conclusions about fast ice directly from this plot of FDD - but would be so much more believable if you plotted a time series of fast ice extent.

Minor comments:

8: I don't think ice charts count as in situ observations.

13: It's not clear why you compare a 14 y period with a 6 y period. These seem quite arbitrary.

14: Avoid "now" - especially since your most recent data are already 2 years old.

14: What time period does a further two degrees warming correspond to?

17: Using the 66 degrees 33 min N definition of the Arctic, this doesn't seem to be true (much more land area in Scandanavia).

17: I don't believe that the location of Svalbard is strongly influenced by any current.

27: Needs a reference.

31: Needs a reference.

34: Extraneous space in this line (and a few other places in the manuscript).

46: Citation style inconsistence with Pavlov ref. Also extraneous hyphen.

Fig 1 caption: Missin degrees and minutes.

62: Most of these datasets are not in situ.

65: Missing circle over capital A (sorry, I don't know the correct name) in Alesund. Elsewhere in manuscript too.

75: We don't need to know the details of the libraries used.

Fig 2 caption: No mention of the date of this chart.

121: Should be just "Breiman (2001)"

171-177: Much of this is repeated - and it also reads like a conclusion.

Fig 7 caption: I think "since" should be "prior to"

186: Repeat of earlier.

191: Significance never tested.

192: Using units of deg C.day feels so imappropriate here, despite being numerically correct. K.day would be preferable.

Fig 8: Why are the two timescales in the comparison so different? In general, throughout the manuscript, the timescales being considered are not well-justified.

Fig 8 caption: Wait a second. The first paragraph of the discussion says you don't use a geographically-weighted random forest - but here and in Fig 10 you say that you do.

221: Typo in Conclusion.

228: "a bit" is too colloquial. Also, why are these points not annotated on Fig. 10?

233: What years to +2 and +4 degrees correspond to?

241: "wits" typo.

242: "rice" typo. Also this whole paragraph is not appropriate in the conclusions section.

275: double comma.

277: "influence of"

314: Capital B for Bay needed.

Does the paper address relevant scientific questions within the scope of TC?

Yes.

Does the paper present novel concepts, ideas, tools, or data?

Yes.

Are substantial conclusions reached?

Yes.

Are the scientific methods and assumptions valid and clearly outlined?

No.

Are the results sufficient to support the interpretations and conclusions?

No - questionable results stemming from choice of FDD threshold and extrapolation of random forest technique. Questionable methodology in the choice to not use cross-validation, nor to test effect of hyperparameters. Questionable methodology for selection of training data (suspect data independence not guaranteed)

Is the description of experiments and calculations sufficiently complete and precise to allow their reproduction by fellow scientists (traceability of results)?

No - Random forest hyperparameters not given. Methods section readability is low due to focus on unimportant details, e.g., file formats.

Do the authors give proper credit to related work and clearly indicate their own new/original contribution?

Yes.

Does the title clearly reflect the contents of the paper?

Yes.

Does the abstract provide a concise and complete summary?

No.

Is the overall presentation well structured and clear?

No - structure is poor from line 155 onward (e.g., no "Results" section; results scattered throughout discussion and conclusion).

Is the language fluent and precise?

Not always - but not a major problem.

Are mathematical formulae, symbols, abbreviations, and units correctly defined and used?

Yes.

Should any parts of the paper (text, formulae, figures, tables) be clarified, reduced, combined, or eliminated?

Yes - extensive clarification and reduction of duplicate information required.

Are the number and quality of references appropriate?

No - Important reference missing completely (Yu et al., 2014. doi:10.1175/JCLI-D-13-00178.1).

Is the amount and quality of supplementary material appropriate?

N/A.

Citation: https://doi.org/10.5194/tc-2021-21-RC1
RC2:
'Comment on tc-2021-21', Angelika Renner, 03 May 2021
Review – UrbaÅski & Litwicka, Accelerated decline of Svalbard coasts fast ice as a result of climate change, The Cryosphere Discussions

The authors investigate the development of land-fast sea ice around Svalbard using a geographical random forest model. The model is built based on ice charts and freezing and thawing degree days. Results are then used to discuss fast ice extent in the period 1973-2019. The approach is interesting but the manuscript has major shortcomings both in the explanation of the method and the overall structure of the manuscript. I therefore cannot recommend publication in its current form.

Major concerns:

A proper results section is missing. Instead, the authors go straight from the description of the random forest model to a discussion, interspersing bits of results with interpretation. It is impossible for the reader to get an impression of the model performance as the main results – the fast ice extent in Isfjorden and around Svalbard over time – is not shown anywhere. The conclusion section contains more results and little in terms of actual conclusions. I strongly recommend restructuring the manuscript: firstly, a basic results section which shows the model results both regarding the spatial distribution of fast ice in the two model domains and the temporal development, and provides a clear overview and summary of model errors, is needed. Then, a comparison of the model results to already published records of fast ice extent would further strengthen trust in model performance before one tries to extend the time series of fast ice extent backwards in time. A proper discussion of strengths and limitations of the model would be useful.

As far as I can see, the only variables included to predict fast ice formation are (positive and negative) freezing degree days. To derive freezing degree days, the authors use temperature records from Hopen, Isfjorden and Kongsfjorden, i.e. in the central Barents Sea and on the west coast of Svalbard. As the spatial distribution of the model performance is not shown, I cannot help but find this highly problematic – conditions are vastly different along the western and the northern and eastern side of Svalbard due to the different hydrographic regimes (periodic Atlantic Water inflow preventing ice formation in the west vs predominantly Arctic Waters in the north and east). In line with the comment above, I suggest to improve the presentation of model results and errors, and to include a discussion regarding validity of the chosen predictor variables in different regions of the model.

Other comments:

Line 46: The authors refer to Pavlov et al., 2019 (formatting needs correcting), but shouldn’t this be Pavlova et al, 2019, from the volume? I.e.: Pavlova O., Gerland S., Hop H. (2019) Changes in Sea-Ice Extent and Thickness in Kongsfjorden, Svalbard (2003–2016). In: Hop H., Wiencke C. (eds) The Ecosystem of Kongsfjorden, Svalbard. Advances in Polar Ecology, vol 2. Springer, Cham. https://doi.org/10.1007/978-3-319-46425-1_4

Line 46: “There has been less fast ice off the northern coasts” – is this really what the authors mean? Or rather that there have been fewer studies on fast ice along the north coast? Fast ice is rather extensive in the north, and more so than in e.g. Kongsfjorden or Isfjorden due to a more Arctic climate.

For the introduction: there have been more recent publications about sea ice cover in Svalbard and in Isfjorden in particular which the authors should consider including:

Dahlke S, Hughes NE, Wagner PM, et al. The observed recent surface air temperature development across Svalbard and concurring footprints in local sea ice cover. Int J Climatol. 2020;40:5246–5265. https://doi.org/10.1002/joc.6517DAHLKE ET AL. 5265

Skogseth, L.L.A. Olivier, F. Nilsen, E. Falck, N. Fraser, V. Tverberg, A.B. Ledang, A. Vader, M.O. Jonassen, J. Søreide, F. Cottier, J. Berge, B.V. Ivanov, S. Falk-Petersen

Variability and decadal trends in the Isfjorden (Svalbard) ocean climate and circulation – an indicator for climate change in the European Arctic

Progr Oceanogr, 187 (2020), 10.1016/j.pocean.2020.102394

Section 2: Please clarify: Is the record from Barentsburg used for Isfjorden? And from Ny-Ålesund for Kongsfjorden?

Section 3: I don’t understand why the authors estimate missing TAVG from interpolated maximum and minimum temperatures – why not interpolating TAVG directly?

Are FDD and TDD estimated for Isfjorden only or for the other locations as well? And how is this then combined in the model for the entire Svalbard archipelago?

Please provide a proper reference for the choice of T_f = 0 degree C. While this is true for freshwater, is is not accurate for fjord waters, but given the wide range of possible salinities in fjords, it probably wouldn’t make much difference.

There is a lot of unnecessary detail in the last part of Section 3 which could be shortened.

Line 120: In Figure 5, it looks like ICESD is used as feature, not ICESN. Which one is correct?

Line 127-128: Here, it is stated that one of the model’s shortcomings is that extrapolation is not possible – however, isn’t that exactly what the authors are attempting? Please clarify, also regarding which range of values is meant (ice-cover vs no ice cover, or range in FDD/TDD, or time period?).

Line 134-135: Please provide an explanation what these results mean and how they are estimated.

Line 137: How is the limit for “satisfactory” set? And what about the Svalbard model?

Figure 4: I would find it more useful to see a time series of fast ice extent instead of what looks like a histogram. In general, if the main model result is a time series and/or distribution of fast ice extent, this should be shown somewhere (both for the Isfjorden and the Svalbard model).

Since the aim is to model fast ice, is there a check whether a hexagon with ice cover is connected to land through other hexagons with ice cover?

Line 160-161: There needs to be a discussion whether this holds true for the entire period, given the known influence of Atlantic Water inflow periods on ice cover in western Svalbard fjords (e.g. Pavlov et al., 2013, Tverberg et al., 2019, Nilsen et al., 2008, Skogseth et al., 2020)

Figure 6: Is there a spatial trend in these, given that the input data for FDD are all in the west or south of Svalbard?

Line 177-178: Gerland et al., 2008 show data for Hopen, not Isfjorden. It would be very helpful to actually see the comparison somewhere!

Figure 7: How did you determine 2005 as the start of the warmer winters?

Figure 8: Without having seen a direct comparison between model and observations for the same period, I find this figure little useful. It is impossible to assess whether the differences between a) and b) are truly because of changes over time, or whether they are influenced by model artifacts or errors. It would make much more sense to show the model results for both periods, or give a proper model-observation comparison first.

Line 203: I guess it’s debatable whether some of the features along the east coast could be considered fjords… However, there is considerable fast ice in Storfjorden.

Section 5 & 6: I’m confused by the different periods considered: How are they chosen? And why is there suddenly in the Conclusion the period 2014-2019? Please provide a justification for the different periods and then use them consistently throughout results, discussion and conclusion.
Citation: https://doi.org/10.5194/tc-2021-21-RC2

Jacek A. Urbański and Dagmara Litwicka

Data sets

Supplement Data Jacek A. Urbański and Dagmara Litwicka https://doi.org/10.5281/zenodo.4597283

Jacek A. Urbański and Dagmara Litwicka

Viewed

Total article views: 1,651 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
1,120	455	76	1,651	70	88

HTML: 1,120
PDF: 455
XML: 76
Total: 1,651
BibTeX: 70
EndNote: 88

Views and downloads (calculated since 12 Mar 2021)

Month	HTML	PDF	XML	Total
Mar 2021	179	54	1	234
Apr 2021	47	14	3	64
May 2021	65	11	2	78
Jun 2021	28	3	0	31
Jul 2021	18	2	0	20
Aug 2021	25	4	1	30
Sep 2021	27	10	0	37
Oct 2021	31	50	0	81
Nov 2021	21	28	0	49
Dec 2021	20	3	0	23
Jan 2022	43	7	1	51
Feb 2022	29	8	4	41
Mar 2022	27	11	3	41
Apr 2022	22	7	0	29
May 2022	8	7	1	16
Jun 2022	7	1	1	9
Jul 2022	7	1	0	8
Aug 2022	8	6	0	14
Sep 2022	9	11	1	21
Oct 2022	11	5	1	17
Nov 2022	9	6	1	16
Dec 2022	13	6	0	19
Jan 2023	8	8	1	17
Feb 2023	14	2	0	16
Mar 2023	11	3	1	15
Apr 2023	5	5	1	11
May 2023	8	3	2	13
Jun 2023	12	6	1	19
Jul 2023	7	6	3	16
Aug 2023	5	10	1	16
Sep 2023	12	4	2	18
Oct 2023	13	10	1	24
Nov 2023	18	4	4	26
Dec 2023	11	6	0	17
Jan 2024	17	3	0	20
Feb 2024	27	15	4	46
Mar 2024	33	8	3	44
Apr 2024	15	4	4	23
May 2024	17	6	3	26
Jun 2024	6	5	1	12
Jul 2024	6	7	3	16
Aug 2024	24	3	1	28
Sep 2024	19	5	0	24
Oct 2024	11	10	0	21
Nov 2024	7	6	0	13
Dec 2024	15	11	7	33
Jan 2025	26	8	2	36
Feb 2025	56	8	1	65
Mar 2025	15	8	4	27
Apr 2025	9	7	0	16
May 2025	11	2	2	15
Jun 2025	15	12	2	29
Jul 2025	13	5	2	20

Cumulative views and downloads (calculated since 12 Mar 2021)

Month	HTML	PDF	XML	Total
Mar 2021	179	54	1	234
Apr 2021	47	14	3	64
May 2021	65	11	2	78
Jun 2021	28	3	0	31
Jul 2021	18	2	0	20
Aug 2021	25	4	1	30
Sep 2021	27	10	0	37
Oct 2021	31	50	0	81
Nov 2021	21	28	0	49
Dec 2021	20	3	0	23
Jan 2022	43	7	1	51
Feb 2022	29	8	4	41
Mar 2022	27	11	3	41
Apr 2022	22	7	0	29
May 2022	8	7	1	16
Jun 2022	7	1	1	9
Jul 2022	7	1	0	8
Aug 2022	8	6	0	14
Sep 2022	9	11	1	21
Oct 2022	11	5	1	17
Nov 2022	9	6	1	16
Dec 2022	13	6	0	19
Jan 2023	8	8	1	17
Feb 2023	14	2	0	16
Mar 2023	11	3	1	15
Apr 2023	5	5	1	11
May 2023	8	3	2	13
Jun 2023	12	6	1	19
Jul 2023	7	6	3	16
Aug 2023	5	10	1	16
Sep 2023	12	4	2	18
Oct 2023	13	10	1	24
Nov 2023	18	4	4	26
Dec 2023	11	6	0	17
Jan 2024	17	3	0	20
Feb 2024	27	15	4	46
Mar 2024	33	8	3	44
Apr 2024	15	4	4	23
May 2024	17	6	3	26
Jun 2024	6	5	1	12
Jul 2024	6	7	3	16
Aug 2024	24	3	1	28
Sep 2024	19	5	0	24
Oct 2024	11	10	0	21
Nov 2024	7	6	0	13
Dec 2024	15	11	7	33
Jan 2025	26	8	2	36
Feb 2025	56	8	1	65
Mar 2025	15	8	4	27
Apr 2025	9	7	0	16
May 2025	11	2	2	15
Jun 2025	15	12	2	29
Jul 2025	13	5	2	20

Viewed (geographical distribution)

Total article views: 1,624 (including HTML, PDF, and XML) Thereof 1,624 with geography defined and 0 with unknown origin.

Country	#	Views	%

Cited

Latest update: 18 Jul 2025

Short summary

The primary aim of the presented research was to characterize the spatial distribution of the mean temporal difference in the presence of fast ice between 1975-2000 and 2014-2019 at the archipelago scale and the fjord scale of Svalbard. The second aim was to quantify the changes in the fast ice surface area in different time periods, and in the near future, assuming the forecast increase in temperature.


Total:	0
HTML:	0
PDF:	0
XML:	0

Accelerated decline of Svalbard coasts fast ice as a result of climate change

Data sets

Viewed

Viewed (geographical distribution)

Cited

4 citations as recorded by crossref.