A Statistical Analysis of Daily Snow Depth Trends in North America

Woody, Jonathan; Xu, Yang; Dyer, Jamie; Lund, Robert; Hewaarachchi, Anuradha P.

doi:10.3390/atmos12070820

Open AccessArticle

A Statistical Analysis of Daily Snow Depth Trends in North America

by

Jonathan Woody

^1,*,

Yang Xu

¹,

Jamie Dyer

²,

Robert Lund

³ and

Anuradha P. Hewaarachchi

⁴

¹

Department of Mathematics and Statistics, Mississippi State University, Mississippi State, MS 39762, USA

²

Department of Geosciences, Mississippi State University, Mississippi State, MS 39762, USA

³

Department of Statistics, University of California, Santa Cruz, CA 95064, USA

⁴

Department of Statistics and Computer Science, University of Kelaniya, Kelaniya 11600, Sri Lanka

^*

Author to whom correspondence should be addressed.

Atmosphere 2021, 12(7), 820; https://doi.org/10.3390/atmos12070820

Submission received: 25 April 2021 / Revised: 18 May 2021 / Accepted: 20 May 2021 / Published: 27 June 2021

(This article belongs to the Special Issue Application of Homogenization Methods for Climate Records)

Download

Browse Figures

Versions Notes

Abstract

:

Several attempts to assess regional snow depth trends have been previously made. These studies estimate trends by applying various statistical methods to snow depths, new snowfalls, or their climatological proxies such as snow water equivalents. In most of these studies, inhomogeneities (changepoints) were not accounted for in the analysis. Changepoint features can dramatically influence trend inferences from climate time series. The purpose of this paper is to present a detailed statistical methodology to estimate trends of a time series of daily snow depths that account for changepoint features. The methods are illustrated in the analysis of a daily snow depth data set from North America.

Keywords:

changepoints; genetic algorithms; snow trends; storage model; time series

1. Introduction

Snow is a viable proxy through which one may study climate change, as the associated modifications to precipitation and temperature patterns are believed to be strongest during the cold season in the mid and high latitudes where snowfall is prominent [1,2,3,4,5]. The rate at which climate is changing in polar regions is thought to exceed the rate at which natural systems can adapt [6]. Global climate models indicate that snow cover changes will considerably impact the cryopsheric portion of the water budget [7,8,9]. Snow is a vital environmental and geophysical quantity and is sensitive to climate change since its magnitude and extent depend on both temperature and precipitation [10,11,12].

Snow depth is the measured (or estimated) depth of a snow pack at a location, and takes into account the accumulation, ablation, and evolution of a snow pack. Therefore, snow depth should not be confused with snow cover (presence/absence) or new snowfall. Satellite data over the Northern Hemisphere suggest that snow cover has lessened since the mid-1980s [6,13,14,15,16,17,18]. Snow depth analyses complement snow cover change studies, providing further information on hydrological resources, surface energy, soil processes, and ecological systems [12,19]. Trend estimates in snow depths [12,13,20,21] and new snowfall [22,23,24,25] over various portions of the United States and Canada have been previously computed and related to climate variability [8,18,26,27,28].

When assessing long-term trends in snow depths, one should consider the temporal homogeneity of the data ([21,23,29,30]). Snow depth series often have discontinuities induced by changes in measuring location, equipment, or methods. These discontinuities—the so-called changepoints (breakpoints)—are crucial in constructing a realistic trend estimate at any one location ([21,24,30]). We do not attempt to attribute causes to any identified changepoints—they could be due to climate shifts, measuring changes, station moves, etc.; see [29] for additional discussion.

Perhaps the paramount contribution here lies with methods to handle the changepoint effects. Our methods apply equally well to snowfall and snow cover trend estimation. In this work, an unknown number of changepoints are modeled as mean level shifts at unknown points in time, for each gridded snow depth series.

Ref. [30] lists four difficulties in estimation of daily snow depth trends:

Snow is seasonal, mostly absent during the summer months at all but Arctic or high alpine locations.
Daily snow depths are highly correlated in time: tomorrow’s snow depth depends on today’s snow depth. Statistical inferences that ignore correlation will yield artificially high levels of confidence.
Snow depths cannot be negative—this “zero modified support set issue" or hard boundary must be addressed.
One should allow for changepoints in trend analyses; as we will see, they are of critical importance.

This is the first study to address all four points above in a regional assessment of snow depth trends.

Previous snow trend studies have used various methods to obtain trends. Some studies use trend estimates obtained by minimizing a sum of squared deviations ([12,20]); however, homogenization issues are not considered there. Ref. [24] quality control their data for inhomogeneities via expert opinion; however, their methods involve a target minus reference series approach, which are difficult to use here since our gridded series may aggregate many series over each grid cell. Other studies ([8,18,28,29]) use the methods in [31], where a non-parametric regression analysis is performed on yearly maximal new snowfall amounts using Kendall’s tau statistic ([32]). While Kendall’s tau statistic can be made to accommodate the serial correlation in snow depth data ([31]), it is non-trivial to do such; moreover, these methods cannot handle changepoint features. Ref. [7] address changepoints by merging distinct station records via the standard normal homogeneity test (SNHT) in [33]. The SNHT is a single changepoint test and may not describe multiple changepoint scenarios well ([34]).

Ref. [29] note that the data used here contain many undocumented changepoints. In general, the snow depth data are replete with changepoints; indeed, the Napoleon series studied in [30] contained 18 documented breakpoints from 1 January 1900 to 31 December 2003.

Estimation of multiple changepoint times via genetic algorithms (GA) has become increasingly popular in climate homogenization pursuits ([35,36]). Here, a GA is applied to the snow depth series at each grid to estimate changepoint times. These changepoints are subsequently used in a stochastic storage model. The storage model approach used here was introduced in [30] and stems from queueing theory. Snow is a natural storage phenomena: the snow depth today is the snow depth yesterday, plus any new snow that has fallen, minus any ablation or compaction. Ref. [37] also use storage models to describe daily snow depths, but do not allow for trends, melting in the fall and winter, or snow increases during spring ablation. In addition to trend estimates, computation of trend standard errors is also considered.

Results are demonstrated by applying our methods to a snow depth data set previously constructed and analyzed in [12,28,29]. This article does not focus on how to assess the suitability of one data set over another, but rather how to get an accurate trend estimate from given data. The papers perhaps most relevant to this study are perhaps [38], which develop additional statistical issues of the methods, and [39], which also studies trends in snow depth time series with periodic aspects considered.

The rest of the study proceeds as follows. A description of the data and study period is presented in Section 2. Section 3 describes our changepoint estimation techniques. Section 4 discusses the stochastic storage model used to compute trend standard errors. Section 5 presents results and concluding remarks.

2. The Data

The development of the gridded data set used in this study is now summarized. The creation of

1^{\circ} \times 1^{\circ}

daily snow depth grids from station level data from 1900–2000 is discussed by Dyer and Mote (2006), although only data from 1960–2000 were analyzed there. Kluver et al. (2016) update the creation and validation procedures introduced in Dyer and Mote (2006), and extend the time record to 1900–2009. A brief review of the data is now discussed; the interested reader is referred to [12,22,29], for additional detail.

The grids considered in the data examined here extend from

25^{\circ}

N to

71^{\circ}

N latitude and from

53^{\circ}

W to

168^{\circ}

W longitude. Station data are interpolated to

1^{\circ} \times 1^{\circ}

grids via the Spheremap spatial interpretation program in [29,40] report station density and found that the number of stations in a grid cell reporting snow depths varies widely with time (see Figure 1 of [29]), with a large increase in the number of reporting stations beginning in 1948 due to the full digitization of Cooperative Observer Program (COOP) station network data. A depiction of station density by grid is given in Figure 3 of [29].

Ref. [29] caution against using pre-1948 data in estimating trends, since station density is sparse before this time. As such, our study will commence in the summer of 1947 and finish in summer 2009 to permit work with “winter centered years” defined below. Observations during the latter half of 2009 will be omitted to accommodate the Section 3 and Section 4 methods.

Although [12] study snow trends on the entirety of North America from 1960–2000, a period in which stations are claimed (without numerical quantification) to be more evenly dispersed, we restrict this study to the largest contiguous grid set reporting at least 20 winter seasons of station observations, subject to the following constraint: the northern most grid in the study region, at each longitude, is set to be the northernmost cell containing at least 20 winter seasons with one or more reporting stations during 1948–2009. Concessions are made to make this region contiguous; this essentially excludes only a few isolated Arctic grids with good data. Our study region will be evident in our graphics.

Ref. [29] admit that the data contain various undocumented inhomogeneities in space and time due to differences in station recording procedures, reaffirming the importance of changepoints. After estimating the changepoint times, they are used in a stochastic storage model to help gauge uncertainty in the estimated trends. The storage model easily accommodates changepoint features.

3. Changepoint Methods

This section discusses our multiple changepoint detection methods. This is tantamount to data homogenization. The methods are illustrated with a snow depth series from a cell centered at latitude 44.5

^{\circ}

N, Longitude 115.5

^{\circ}

W, located near Warm Lake in the Boise National Forest in Central Idaho.

Genetic algorithms (GAs) have been successfully used in recent climate homogenization studies ([34,35]). GAs use principles of genetic selection and mutation to intelligently search for the best possible (as defined below) changepoint configuration. Although trends in daily depths are our focus, changepoints in yearly average snow depth series are first estimated. Homogenization for daily series is significantly more involved because of the long series lengths encountered ([36]). Any changepoint detected in the yearly series is assigned to the first day of the corresponding gridwinter centered year.

Let

{X_{t}}_{t = 1}^{N}

denote the snow depth series at a fixed grid for days

t = 1, \dots, N

. Winter centered years (WCYs) are analyzed here. The starting observation for any WCY is 1 July; 30 June of the subsequent calendar year is the last day. WCYs prevent a single winter season from straddling two distinct years. To have 1948 commence our study,

t = 1

will correspond to 1 July 1947;

t = N = 22, 265

corresponds to 30 June 2009, the study end date. Leap year data are handled by deleting the 30 June observation within the calendar year on which 29 February occurs—this has virtually no impact on results. There are

n = 61

years in our study. For a periodic notation with time, we write time t as

t = d T + ν

where

d \in {0, \dots, n - 1}

is the WCY and

ν \in {1, \dots, T = 365}

represents the day within the WCY.

At a given grid, define

Y_{d} = # {(W)}^{- 1} \sum_{ν \in W} X_{d T + ν}

as the average winter snow depth for WCY

d \in {0, \dots, n - 1}

, where

# (A)

is the number of elements in the set A and

W

denotes the snow season for this grid. The snow season is considered to start on the first day in the Fall/Winter on which snow is present on at least 35 percent of the WCY years in the study; the last day of the WCY is the latest day in the Winter/Spring on which at least 35 percent of the WCY years report snow. The 35 percent proportion is somewhat arbitrary, but is chosen as it is high enough for accurate inferences to be made based on the minimum number of days with snow cover without being overly selective. For the grid centered near Warm Lake, Idaho, the snow season spans 25 October to 21 May inclusive, a length of 218 days.

We now describe our changepoint detection methods. Due to averaging (the central limit theorem),

Y_{d}

is approximately normally distributed. Hence, the changepoint techniques for normal data from [34] apply. Trend components must be included in the changepoint detection methodology to avoid identifying spurious changepoints caused by the presence of non-zero trends ([41]). Our basic model for the yearly average snow depths

{Y_{d}}_{d = 0}^{n - 1}

is the time series regression of form

Y_{d} = α + γ d + μ_{d} + ϵ_{d} .

(1)

Here,

α

models the grid’s average daily snow depth (less any trend),

γ

is a linear trend parameter,

μ_{d}

is a changepoint effect described further below, and

{ϵ_{d}}_{d = 0}^{n - 1}

is a zero-mean first order autoregressive (AR(1)) error process that induces temporal correlation into the yearly averages. The AR(1) errors obey

ϵ_{d} = ϕ ϵ_{d - 1} + Z_{d}

, where

{Z_{d}}

is a zero-mean white noise process with variance

σ^{2}

, and

ϕ \in (- 1, 1)

is the correlation between consecutive years of average snow depths. The changepoint effect

μ_{d}

, with k mean shifts at times

τ_{1} < τ_{2} < \dots < τ_{k}

, respectively, is

μ_{d} = \{\begin{matrix} Δ_{0} = 0, & τ_{0} < d < τ_{1} \\ Δ_{1}, & τ_{1} \leq d < τ_{2} \\ ⋮ & ⋮ \\ Δ_{k}, & τ_{k} \leq d < τ_{k + 1} \end{matrix},

where

τ_{0} = 0

and

τ_{k + 1} = n + 1

by convention. Here,

Δ_{ℓ}

is interpreted as the changepoint effect of the ℓth regime, measured relative to the first regime (

Δ_{ℓ} - Δ_{ℓ - 1}

is the shift magnitude since the last regime).

This model contains the regression parameters

α, γ, Δ_{1}, \dots, Δ_{k}

, the changepoint parameters k,

τ_{1}, τ_{2}, \dots, τ_{k}, τ_{k + 1}

and the time series parameters

ϕ

and

σ^{2}

. For a given configuration of k changepoints occurring at the times

τ_{1}, \dots, τ_{k}

, the regression parameters are estimated using standard linear regression methods; Yule–Walker estimators are fitted to linear regression residuals to estimate the time series parameters

ϕ

and

σ^{2}

. ([42] provide details).

Estimation of the best changepoint configuration of a time series is a statistical model selection problem. Popular model selection criteria include the Akaike Information Criterion (AIC), the Bayesian Information (BIC), and Minimum Description Lengths (MDLs). To date, MDL methods have produced superior empirical results ([43,44]). MDL methods minimize an objective function of form

MDL = - ln (L_{opt}) + P

over all possible changepoint configurations. Here,

- ln (L_{opt})

is an optimal model likelihood given the number of changepoints k and their locations

τ_{1}, \dots, τ_{k}

, and P, which depends on the number of changepoints and their location times, is a penalty term to prevent overfitting. Unlike AIC and BIC penalties, MDL penalties depend on where the changepoints lie. The MDL penalty is more than a multiple of the number of changepoints, penalizing closely spaced changepoint times more than sparsely spaced changepoints.

The Innovations form of the multivariate Gaussian likelihood is used:

L (ϕ, σ, α, γ; k, Δ_{1}, \dots, Δ_{k}) = {(2 π)}^{- \frac{N}{2}} {(\prod_{t = 0}^{N - 1} v_{t})}^{- \frac{1}{2}} exp (- \frac{1}{2} \sum_{t = 1}^{N} \frac{{(Y_{t} - {\hat{Y}}_{t})}^{2}}{v_{t - 1}}) .

(2)

Here,

{\hat{Y}}_{t}

is the one-step-ahead predictor of

Y_{t}

and

v_{t} = E [{(Y_{t + 1} - {\hat{Y}}_{t + 1})}^{2}]

is its mean squared prediction error. For an AR(1) series,

{\hat{Y}}_{t} = {\hat{m}}_{t} + \hat{ϕ} (Y_{t - 1} - {\hat{m}}_{t - 1})

, where

{\hat{m}}_{t} = \hat{α} + \hat{γ} d + {\hat{μ}}_{d}

, and

v_{t} = {\hat{σ}}^{2}

. Parameter estimates for

ϕ

and

σ

are calculated from the Yule–Walker equations involving the sample variance and lag one autocorrelation of

{{\hat{ϵ}}_{d}}_{d = 0}^{n - 1}

. Substituting the estimators of

ϕ, σ, α, γ; k, Δ_{1}, \dots, Δ_{k}

into (2) gives

ln (L_{opt})

for the changepoint configuration with k changepoints at locations

τ_{1}, \dots, τ_{k}

.

The penalty term for the changepoint configuration is that in [35]. When

k > 0

,

P = P (k; τ_{1}, \dots, τ_{k}) = \frac{1}{2} \sum_{ℓ = 1}^{k + 1} log (τ_{ℓ} - τ_{ℓ - 1}) + \sum_{ℓ = 2}^{k} log (τ_{ℓ}) + {log}_{2} (k);

when

k = 0

, the penalty vanishes from the model (

P = 0

). The best changepoint model is estimated as the one that minimizes the MDL score. As an exhaustive search for changepoint configuration(s) requires evaluation of

2^{n}

MDL scores, an arduous task even on the world’s fastest computers for even moderately large n. Here, a GA was devised to perform the minimization. GAs have recently been employed for multiple changepoint detection in climate data ([34,35,44]). See [45] for more on genetic algorithms.

Panel 1 of Figure 2 depicts the number of changepoints estimated by year for our 1217 grid study region. Three peaks are seen in 1953, 1974, and 1978 respectively. As an example, when these methods are applied to the Warm Lake, Idaho grid, changepoints are declared on 1 July of 1979, 1984, and 1988. Panel 1 of Figure 2 graphically depicts the MDL and a simple linear regression fit to

{Y_{d}}_{d = 0}^{n - 1}

. For this grid, the slope of the MDL regression structure is more negative than that for the simple linear trend when changepoints are ignored; more particulars about this example are given later.

4. The Storage Model

Ref. [30] introduce a storage model to estimate trends in daily snow depths in the presence of changepoints. This model was fitted to a time series from Napoleon, North Dakota.

For each fixed grid, estimates of the number of mean shifts k and their times

τ_{1}, \dots, τ_{k}

are now in place. Our model for the daily snow depths is based on the storage equation

X_{t} = max {X_{t - 1} + C_{t}, 0},

where

C_{t}

quantifies the random change in the snow pack from day

t - 1

to day t. The max term prevents the snow depths from becoming negative. Here, the depth change

C_{t}

is assumed statistically independent of

X_{1}, \dots, X_{t - 1}

.

We posit that

m_{t} : = E [C_{t}]

has a periodic component, possibly multiple changepoints, and a linear trend. The variance

V a r (C_{t}) = w_{t}^{2}

is assumed periodic with period

T = 365

. For computational convenience,

C_{t}

is assumed normally distributed:

C_{t} \sim N (m_{t}, w_{t}^{2})

; other distributions can be easily used if desired. Specifically,

m_{t}

is parameterized at time

t = n T + ν

as

m_{t} = P_{ν} [A + B cos (\frac{2 π (ν - ζ)}{T}) + δ_{t} + γ t],

(3)

where the mean shifts are parametrized as in the last section:

{\hat{δ}}_{t} = \{\begin{matrix} 0, & t \in {1, \dots, \hat{τ_{1}} T - 1} \\ Δ_{1}, & t \in {{\hat{τ}}_{1} T, \dots, {\hat{τ}}_{2} T - 1} \\ ⋮ & ⋮ \\ Δ_{\hat{k}}, & t \in {{\hat{τ}}_{\hat{k}} T, \dots, d n} \end{matrix} .

The term

P_{ν}

is a zero/one indicator that is unity only when

ν \in W

.

To estimate the storage model parameters (this is not the same task as estimating changepoint numbers and locations), we minimize the weighted sum of squared one-step-ahead prediction errors

S (θ) = \sum_{d = 0}^{n - 1} \sum_{ν = 1}^{T} \frac{{(X_{d T + ν} - {\hat{X}}_{d T + ν})}^{2}}{w_{ν}^{2}} .

Here,

θ = {(A, B, ζ, γ, Δ_{1}, \dots, Δ_{k})}^{'}

is a vector containing all model parameters and

{\hat{X}}_{t} : = E_{θ} [X_{t + 1} | X_{t}]

is the one-step-ahead prediction, calculated in Woody et al. (2009) as

E_{θ} [X_{t + 1} | X_{t}] = [X_{t} + m_{t + 1}] [1 - Φ (\frac{- X_{t} - m_{t + 1}}{w_{t + 1}})] + w_{t + 1} ϕ (\frac{X_{t} + m_{t + 1}}{w_{t + 1}}),

(4)

where

Φ (\cdot)

and

ϕ (x) = e^{- x^{2} / 2} / \sqrt{2 π}

are the cumulative distribution and density functions, respectively, of a standard normal random variable. Because snow is seasonal, weighted least squares is used. Here, the weight

w_{ν}

is set to

Var (Z_{ν})

. To estimate

w_{ν}

, first calculate a point estimate of the mean change from day

ν - 1

to day

ν

via

{\hat{e}}_{ν} = n^{- 1} \sum_{d = 0}^{n - 1} (X_{d T + ν} - X_{d T + ν - 1})

. The estimate of

w_{ν}

is then simply a multiple of the sample standard deviation:

{\hat{w}}_{ν}^{2} = n^{- 1} \sum_{d = 0}^{n - 1} {(X_{d T + ν} - X_{d T + ν - 1} - {\hat{e}}_{ν})}^{2} .

We smooth these daily variances via the Matlab function “cfit” before using them to minimize variability.

Some minor quality control is applied to the daily snow depths before trend inference is conducted. Following [46], if the day to day change in snow depths during season

ν

is more than four standard deviations from the mean daily change for season

ν

,

{\hat{e}}_{ν}

, then the data for that day is flagged and considered missing.

While the parameter

γ

controls the trend and is the object of our study, it is not easily interpretable within the storage equation. For example, because of the maximum in the storage equation,

\hat{γ} = 50

does not imply a depth change of 50 units per time. A quantity that does have interpretable units of depth change per time is the linear trend statistic in the presence of changepoints:

\hat{β} = \frac{\sum_{r = 1}^{k + 1} \sum_{n \in H_{r}} \sum_{ν = 1}^{T} (X_{n T + ν} - {\bar{X}}_{r} (ν)) (n T + ν - {\bar{t}}_{r} (ν))}{\sum_{r = 0}^{k} \sum_{n \in H_{r}} \sum_{ν = 1}^{T} {(n T + ν - {\bar{t}}_{r} (ν))}^{2}},

(5)

where

H_{r} = {t : τ_{r - 1} \leq t < τ_{r}}

denote the set of times the series experienced the r-th regime,

r \in {0, 1, \dots, k}

, and

{\bar{X}}_{r} (ν)

and

{\bar{t}}_{r} (ν)

denotes the average snow depth and time, respectively, for day

ν

in regime r. See [47] or [48] for more on this statistic. In this context,

\hat{β}

estimates the mean change in the snow depth per time (scaled to cm per century below). To obtain standard errors for

\hat{β}

, we simply simulate 1000 independent realizations of the storage model with parameters as estimated from the data for each grid, and then compute the sample mean and standard deviations of the 1000 trend estimates in (5). Simulation is used because explicit forms for storage equation means do not exist.

At Warm Lake, three changepoints are estimated, and occur on 1 July 1979, 1 July 1984, and 1 July 1988. Parameter estimates for

m_{t}

in Equation (3) are

\hat{A} = - 0.8772

,

\hat{B} = 1.9843

,

\hat{ζ} = 181.0640

,

\hat{γ} = - 0.9813

,

{\hat{Δ}}_{1} = 0.9709

,

{\hat{Δ}}_{2} = 0.5225

, and

{\hat{Δ}}_{3} = 0.0034

. The negative

\hat{γ}

implies overall declining snow depths; the positive

{\hat{Δ}}_{1}

entails an abrupt shift to a snowier era from 1 July 1979–30 June 1984. This is followed by a modest drop in snow depths from 1 July 1984–29 June 1988, and finally another decrease in depths from 1 July 1988 onwards. The thousand simulations of the storage model with these parameters report an average trend of

\hat{β} = - 42.81

cm/century, with a standard error of

12.64

cm/century. This yields a z-score of

- 3.3869

when testing the null hypothesis that

β = 0

(a one-sided p-value of

3.5344 \times 10^{- 04}

). The daily snow depths from 1 July 1948 to 30 June 2009 are depicted in Panel 3 of Figure 2. A simulation of the daily snow depths using the parameter estimates from the storage model fit reported above is shown in Panel 4 of Figure 2 and depicts a reasonable likeness to the raw data in Panel 3, including the estimated mean level shifts (simulation is another use of the storage model).

In contrast, if changepoints are ignored, parameter estimates are

\hat{A} = - 0.7728

,

\hat{B} = 1.8748

,

\hat{ζ} = 179.9746

, and

\hat{γ} = - 0.6544

. The thousand simulations of the storage model with these parameters report an average trend statistic of

\hat{β} = - 30.1848

cm/century, with a standard error of

5.6673

cm/century. This yields a z-score of

- 5.3261

when testing that

β = 0

(a one-sided p-value of

5.0172 \times 10^{- 8}

). Overall, snow depths are inferred to be decreasing at Warm Lake, more so when changepoints are taken into account.

5. Results

North American results are described for two cases: (1) when changepoints were ignored and (2) when changepoints were included. The study area contained 6613 grids in total; however, only 1217 grids in the study region experienced a winter season as defined above or have data that met our quality constraints. The average number of changepoints over these 1217 grids was 0.7198 with a standard error of 0.8975. More specifically, 664 (54.56%) of the grids were changepoint free, 272 grids (22.35%) had only one changepoint, 241 grids (19.80%) had two changepoints, 38 grids (03.12%) had three changepoints, and 2 (0.16%) had four changepoints (the maximum number detected).

More grids reported negative depth trends than positive depth trends, regardless of whether or not changepoints were considered. When changepoints were ignored, 469 grids (38.54%) reported positive depth trends and 748 grids (61.47%) reported negative depth trends. When changepoints were taken into account, 537 grids (44.12%) reported positive snow depth trends and 680 grids (55.88%) reported negative snow depth trends.

The magnitude of the estimated trends was a different story: the average grid trend was 0.3212 cm/century without changepoints, with a standard error of 4.6585, and 0.1519 cm/century, with a standard error of 9.1046, when changepoints were included. Overall, the addition of changepoints reduced the average depth trend.

The mean trend of grids with positive trends when changepoints were ignored was 2.7735 cm/century; the mean trend for grids with decreasing trends when changepoints were ignored was −0.8551 cm/century. In contrast, the mean trend for grids reporting increasing trends was 4.9067 cm/century when changepoints were taken into account; the mean trend for decreasing grids was −4.9698 cm/century when changepoints were taken into account. The magnitude of the trends was substantially larger when changepoints were considered.

6. Conclusions

The top two panels of Figure 1 show trends, in cm/century, with and without changepoints. The bottom two panels of Figure 1 depict smoothed z-scores from the trends. The z-scores are computed assuming a null hypothesis that the trend is zero, with the trend

\hat{β}

at each station derived by dividing the trend by the standard error of the trend estimate.

Some spatial patterns are evident in the Figure 1 depth trends and z-scores. We will explain our interpretation of the z-score maps; interpretation of the raw trends proceeds similarly. Increasing depth trends are evident in the Southern and Northern Rocky Mountains, regardless of whether changepoints are considered. Much of the rest of the study area, particularly the US Midwest and plains states and Canadian provinces, are seeing declining snow depths. One area where inferences change when changepoints are neglected/accounted lies near the Southeastern Ontario and Southwestern Quebec border. In conclusion, the authors believe changepoint considerations to be of critical importance when performing trend inference in a time series of snow depths.

Author Contributions

J.W. and R.L. developed the statistical conceptualization methods of the work. Data and climatological aspects were dealt with by J.D. Coding and computational issues were handled by Y.X. and A.P.H., J.W., R.L. and J.D. all shared in the writing of the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Jamie Dyer aided in constructing the original version of the data set. The data used in this study is available from him.

Acknowledgments

Jon Woody thanks the National Strategic Planning and Analysis Center.

Conflicts of Interest

The authors declare no conflict of interest.

References

AMAP. Snow, Water, Ice and Permafrost in the Arctic (SWIPA): Climate Change and the Cryosphere; Monitoring and Assessment Programme (AMAP): Oslo, Norway, 2011; 538p. [Google Scholar]
Anisimov, O.A. Potential feedback of thawing permafrost to the global climate system through methane emission. Environ. Res. Lett. 2007, 2. [Google Scholar] [CrossRef]
Serreze, M.; Clark, M.P.; McGinnis, D.L.; Robinson, D.A. Characteristics of snowfall over the eastern half of the United States and relationships with principal models of low-frequency atmospheric variability. J. Clim. 1998, 11, 234–250. [Google Scholar] [CrossRef]
Pithan, F.; Mauritsen, T. Arctic amplification dominated by temperature feedbacks in contemporary climate models. Nat. Geosci. 2014, 7, 181–184. [Google Scholar] [CrossRef]
Vincent, L.A.; Wang, X.L.; Milewska, E.J.; Wan, H.; Yang, F.; Swail, V. A second generation of homogenized Canadian monthly surface air temperature for climate trend analysis. J. Geophys. Res. 2012, 117, D18110. [Google Scholar] [CrossRef]
IPCC. Climate change 2014: Synthesis report. In Contribution of Working Groups I, II and III to the Fifth Assessment Report of the Intergovernmental Panel on Climate Change; IPCC: Geneva, Switzerland, 2014; 151p. [Google Scholar]
Vincent, L.A.; Mekis, E. Discontinuities due to joining precipitation station observations in Canada. J. Appl. Meteorol. Climatol. 2009, 48, 156–166. [Google Scholar] [CrossRef]
Vincent, L.A.; Zang, X.; Brown, R.D.; Feng, Y.; Mekis, E.; Milewska, E.J.; Wan, H.; Wang, X.L. Observed trends in Canada’s climate and influence of low-frequency variability modes. J. Clim. 2015, 28, 4545–4560. [Google Scholar] [CrossRef]
Barnett, T.P.; Adam, J.C.; Lettenmaier, P. Potential impacts of a warming climate on water availability in snow dominated regions. Nature 2005, 438, 303–309. [Google Scholar] [CrossRef]
Kulka, G.J. Climatic role of snow covers. In Sea Level, Ice and Climatic Change; IAHS Publication: Wallingford, UK, 1979; Volume 131, pp. 79–107. [Google Scholar]
Barry, R.G. Evidence of recent changes in global snow and ice cover. GeoJournal 1990, 20, 121–127. [Google Scholar] [CrossRef]
Dyer, J.L.; Mote, T.L. Spatial variability and trends in observed snow depth over North America. Geophys. Res. Lett. 2006, 33. [Google Scholar] [CrossRef]
Groisman, P.Y.; Karl, T.R.; Knight, R.W. Changes of snow cover, temperature, and radiative heat balance over the Northern Hemisphere. J. Clim. 1994, 7, 1633–1656. [Google Scholar] [CrossRef] [Green Version]
Robinson, D.A. Northern Hemisphere snow extent during the satellite era. In Proceedings of the Fifth Conference on Polar Meteorology and Oceanography, Dallas, TX, USA, 10–15 January 1999; American Meteorological Society: Boston, MA, USA, 1999. [Google Scholar]
Robinson, D.A.; Frei, A. A Northern Hemisphere snow cover climatology using satellite information. In Proceedings of the 10th Conference on Applied Climatology, Reno, NV, USA, 19–23 October 1997; American Meteorological Society: Boston, MA, USA, 1997. [Google Scholar]
Robinson, D.A.; Dewey, K.F.; Heim, R.R. Global snow cover and monitoring: An update. Bull. Am. Meteorol. Soc. 1993, 74, 1689–1696. [Google Scholar] [CrossRef] [Green Version]
Karl, T.R.; Groisman, P.Y.; Knight, R.W.; Heim, R., Jr. Recent variations of snow cover and snowfall in North America and their relation to precipitation and temperature variations. J. Clim. 1993, 6, 1327–1344. [Google Scholar] [CrossRef] [Green Version]
Zhang, X.; Zwiers, F.; Hergerl, G.C.; Lambert, F.; Gillett, N.; Solomon, S.; Stott, P.; Nozawa, T. Detection of human influence on twentieth-century precipitation trends. Nature 2007, 448, 461–465. [Google Scholar] [CrossRef] [PubMed]
Brown, R.D. Northern Hemisphere snow cover variability and change, 1915–97. J. Clim. 2000, 13, 2339–2355. [Google Scholar] [CrossRef]
Brown, R.D.; Braaten, R.O. Spatial and temporal variability of Canadian monthly snow depths, 1946–1995. Atmosphere-Ocean 1998, 36, 37–45. [Google Scholar] [CrossRef]
Grundstein, A.; Mote, T.L. Trends in average snow depth across the western United States. Phys. Geogr. 2010, 31, 172–185. [Google Scholar] [CrossRef]
Kluver, D. Characteristics and Trends in North American Snowfall from a Comprehensive Gridded Data Set. Master’s Thesis, Department of Geography, University of Delaware, Newark, DE, USA, 2007; 146p. [Google Scholar]
Kunkel, K.E.; Palecki, M.A.; Hubbard, K.G.; Robinson, D.A.; Redmond, K.; Easterling, D.R. Trend identification in twentieth-century U.S. snowfall: The challenges. J. Atmos. Ocean Technol. 2007, 24, 64–73. [Google Scholar] [CrossRef]
Kunkel, K.E.; Palecki, M.; Ensor, L.; Hubbard, K.G.; Robinson, D.; Redmond, K.; Easterling, D.R. Trend identification in twentieth-century U.S. snowfall using a quality-controlled dataset. J. Atmos. Ocean Technol. 2009, 26, 33–44. [Google Scholar] [CrossRef] [Green Version]
Serreze, M.; Francis, J. The Arctic amplification debate. Clim. Chang. 2006, 76, 241–264. [Google Scholar] [CrossRef] [Green Version]
Aguado, E.; Cayan, D.; Riddle, L.; Roos, M. Climatic fluctuations and the timing of West Coast streamflow. J. Clim. 1992, 5, 1468–1483. [Google Scholar] [CrossRef] [Green Version]
Cayan, D.R.; Mckee, D.B.; Doesken, N.J. Annual snowpack patterns across the Rockies: Long-term trends and associated 500-mb synoptic patterns. Mon. Weather Rev. 1996, 121, 633–647. [Google Scholar]
Brown, R.D. Analysis of snow cover variability and change in Quebec: 1948–2005. Hydrol. Process. 2010, 24, 1929–1954. [Google Scholar] [CrossRef]
Kluver, D.; Mote, T.L.; Leathers, D.; Henderson, G.R.; Chan, W.; Robinson, D.A. Creation and validation of a comprehensive 1^∘×1^∘ daily gridded North American dataset for 1900–2009: Snowfall. J. Atmos. Ocean Technol. 2016, 33, 857–871. [Google Scholar] [CrossRef]
Woody, J.; Lund, R.B.; Grundstein, A.; Mote, T.L. A storage model approach to the assessment of snow depth trends. Water Resour. Res. 2009, 45, W10426. [Google Scholar] [CrossRef] [Green Version]
Wang, X.L.; Swail, V.R. Changes in extreme wave heights in Northern Hemisphere oceans and related atmospheric circulation regimes. J. Clim. 2001, 14, 2204–2220. [Google Scholar] [CrossRef]
Sen, P.K. Estimates of the regression coefficient based on Kendall’s tau. J. Am. Stat. Assoc. 1968, 63, 1379–1389. [Google Scholar] [CrossRef]
Alexandersson, H. A homogeneity test applied to precipitation data. J. Clim. 1986, 6, 661–675. [Google Scholar] [CrossRef]
Li, S.; Lund, R.B. Multiple changepoint detection via genetic algorithms. J. Clim. 2012, 25, 674–686. [Google Scholar] [CrossRef]
Lee, J.; Li, S.; Lund, R.B. Trends in extreme United States temperatures. J. Clim. 2014, 27, 4209–4225. [Google Scholar] [CrossRef]
Hewaarachchi, A.P.; Li, Y.; Lund, R.B.; Renne, J. Homogenization of Daily Temperature Data. J. Clim. 2017, 30, 985–999. [Google Scholar] [CrossRef]
Perona, P.A.; Porporato, P.; Ridolfi, L. A stochastic process for the interannual snow storage and melting dynamics. J. Geophys. Res. Atmos. 2007, 112, D08107. [Google Scholar] [CrossRef] [Green Version]
Lee, J.; Lund, R.; Woody, J.; Xu, Y. Trend assessment for daily snow depths with changepoint considerations. Environmetrics 2020, 31, e2580. [Google Scholar] [CrossRef] [Green Version]
Olefs, M.; Koch, R.; Schöner, W.; Marke, T. Changes in snow depth, snow cover duration, and potential snowmaking conditions in Austria, 1961–2020—A model based approach. Atmosphere 2020, 11, 1330. [Google Scholar] [CrossRef]
Willmott, C.J.; Rowe, C.M.; Philpot, W.D. Spheremap Software; Center for Climate Research; Department of Geography, University of Delaware: Newark, DE, USA, 2006. [Google Scholar]
Gallagher, C.; Lund, R.B.; Robbins, M. Changepoint detection in climate series with long-term trends. J. Clim. 2013, 26, 4994–5006. [Google Scholar] [CrossRef]
Brockwell, P.J.; Davis, R.A. Time Series: Theory and Methods, 2nd ed.; Springer: New York, NY, USA, 1991; 577p. [Google Scholar]
Davis, R.A.; Lee, T.C.M.; Rodriguez-Yam, G.A. Structural break estimation for nonstationary time series models. J. Am. Stat. Assoc. 2006, 101, 223–239. [Google Scholar] [CrossRef]
Lu, Q.; Lund, R.B.; Lee, T.M.C. An MDL approach to the climate segmentation problem. Ann. Appl. Stat. 2007, 4, 299–319. [Google Scholar] [CrossRef] [Green Version]
Davis, L. Handbook of Genetic Algorithms; Van Nostrand Reinhold: New York, NY, USA, 1991; 385p. [Google Scholar]
Woody, J.; Wang, Y.; Dyer, J. Application of multivariate storage model to quantify trends in seasonally frozen soil. Open Geosci. 2016, 14, 310–322. [Google Scholar] [CrossRef] [Green Version]
Lund, R.B.; Seymour, P.L.; Kafadar, K. Temperature trends in the United States. Environmetrics 2000, 12, 673–690. [Google Scholar] [CrossRef]
Woody, J. Time series regression with persistent level shifts. Stat. Probab. Lett. 2015, 102, 22–29. [Google Scholar] [CrossRef]

Figure 1. Top left graph: North American snow trends when changepoints are neglected. Top right graph: North American snow trends when changepoints are taken into account. Bottom left graph: z scores for no trend when changepoints are neglected. Bottom right graph: z-scores for no trend when changepoints are taken into account.

Figure 2. Top graph: changepoints found by year over all grids. Second graph from top: the model’s estimated mean for the daily snow depths at the Warm Lake, Idaho grid, visually demonstrating the effects of a negative trend and the three changepoints. Seasonal effects are not included for visual clarity. second graph from bottom: The Warm Lake, Idaho observed snow depths. Bottom graph: a simulated realization of the Warm Lake, Idaho snow depths from the fitted model. This simulation includes the three changepoints.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Woody, J.; Xu, Y.; Dyer, J.; Lund, R.; Hewaarachchi, A.P. A Statistical Analysis of Daily Snow Depth Trends in North America. Atmosphere 2021, 12, 820. https://doi.org/10.3390/atmos12070820

AMA Style

Woody J, Xu Y, Dyer J, Lund R, Hewaarachchi AP. A Statistical Analysis of Daily Snow Depth Trends in North America. Atmosphere. 2021; 12(7):820. https://doi.org/10.3390/atmos12070820

Chicago/Turabian Style

Woody, Jonathan, Yang Xu, Jamie Dyer, Robert Lund, and Anuradha P. Hewaarachchi. 2021. "A Statistical Analysis of Daily Snow Depth Trends in North America" Atmosphere 12, no. 7: 820. https://doi.org/10.3390/atmos12070820

APA Style

Woody, J., Xu, Y., Dyer, J., Lund, R., & Hewaarachchi, A. P. (2021). A Statistical Analysis of Daily Snow Depth Trends in North America. Atmosphere, 12(7), 820. https://doi.org/10.3390/atmos12070820

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Statistical Analysis of Daily Snow Depth Trends in North America

Abstract

1. Introduction

2. The Data

3. Changepoint Methods

4. The Storage Model

5. Results

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI