For examples, we’re going to use one of the same datasets that are widely documented in the MARSS manual. This data consists of three columns, * year, representing the year of the observation * logit.s, representing logit-transformed survival of Columbia River salmon * CUI.apr, representing coastal upwelling index values in April
## This is loo version 2.8.0
## - Online documentation and vignettes at mc-stan.org/loo
## - As of v2.0.0 loo defaults to 1 core but we recommend using as many as possible. Use the 'cores' argument or set options(mc.cores = NUM_CORES) for an entire session.
## year logit.s CUI.apr
## 1 1964 -3.46 57
## 2 1965 -3.32 5
## 3 1966 -3.58 43
## 4 1967 -3.03 11
## 5 1968 -3.61 47
## 6 1969 -3.35 -21
For a first model, we’ll fit the same model used in the
MARSS
manual, with a time varying intercept and time
varying coefficient on CUI. We can specify these time varying effects
using the time_varying
argument. Note an intercept is not
included, like with lm
, glm
, and similar
packages this the intercept is implicitly included. Note that
alternatively you could specify this formula as
time_varying = logit.s ~ 1 + CUI.apr
, where we add the
extra 1
to signify the intercept.
Like the MARSS
example, we also standardize the
covariates,
SalmonSurvCUI$CUI.apr = scale(SalmonSurvCUI$CUI.apr)
fit <- fit_dlm(time_varying = logit.s ~ CUI.apr,
data = SalmonSurvCUI,
chains=1,
iter=4000)
##
## SAMPLING FOR MODEL 'dlm' NOW (CHAIN 1).
## Chain 1:
## Chain 1: Gradient evaluation took 4.5e-05 seconds
## Chain 1: 1000 transitions using 10 leapfrog steps per transition would take 0.45 seconds.
## Chain 1: Adjust your expectations accordingly!
## Chain 1:
## Chain 1:
## Chain 1: Iteration: 1 / 4000 [ 0%] (Warmup)
## Chain 1: Iteration: 400 / 4000 [ 10%] (Warmup)
## Chain 1: Iteration: 800 / 4000 [ 20%] (Warmup)
## Chain 1: Iteration: 1200 / 4000 [ 30%] (Warmup)
## Chain 1: Iteration: 1600 / 4000 [ 40%] (Warmup)
## Chain 1: Iteration: 2000 / 4000 [ 50%] (Warmup)
## Chain 1: Iteration: 2001 / 4000 [ 50%] (Sampling)
## Chain 1: Iteration: 2400 / 4000 [ 60%] (Sampling)
## Chain 1: Iteration: 2800 / 4000 [ 70%] (Sampling)
## Chain 1: Iteration: 3200 / 4000 [ 80%] (Sampling)
## Chain 1: Iteration: 3600 / 4000 [ 90%] (Sampling)
## Chain 1: Iteration: 4000 / 4000 [100%] (Sampling)
## Chain 1:
## Chain 1: Elapsed Time: 8.123 seconds (Warm-up)
## Chain 1: 7.468 seconds (Sampling)
## Chain 1: 15.591 seconds (Total)
## Chain 1:
## Warning: There were 170 divergent transitions after warmup. See
## https://mc-stan.org/misc/warnings.html#divergent-transitions-after-warmup
## to find out why this is a problem and how to eliminate them.
## Warning: There were 1 chains where the estimated Bayesian Fraction of Missing Information was low. See
## https://mc-stan.org/misc/warnings.html#bfmi-low
## Warning: Examine the pairs() plot to diagnose sampling problems
## Warning: Bulk Effective Samples Size (ESS) is too low, indicating posterior means and medians may be unreliable.
## Running the chains for more iterations may help. See
## https://mc-stan.org/misc/warnings.html#bulk-ess
## Warning: Tail Effective Samples Size (ESS) is too low, indicating posterior variances and tail quantiles may be unreliable.
## Running the chains for more iterations may help. See
## https://mc-stan.org/misc/warnings.html#tail-ess
##
## SAMPLING FOR MODEL 'dlm' NOW (CHAIN 1).
## Chain 1:
## Chain 1: Gradient evaluation took 4e-05 seconds
## Chain 1: 1000 transitions using 10 leapfrog steps per transition would take 0.4 seconds.
## Chain 1: Adjust your expectations accordingly!
## Chain 1:
## Chain 1:
## Chain 1: Iteration: 1 / 1000 [ 0%] (Warmup)
## Chain 1: Iteration: 100 / 1000 [ 10%] (Warmup)
## Chain 1: Iteration: 200 / 1000 [ 20%] (Warmup)
## Chain 1: Iteration: 300 / 1000 [ 30%] (Warmup)
## Chain 1: Iteration: 400 / 1000 [ 40%] (Warmup)
## Chain 1: Iteration: 500 / 1000 [ 50%] (Warmup)
## Chain 1: Iteration: 501 / 1000 [ 50%] (Sampling)
## Chain 1: Iteration: 600 / 1000 [ 60%] (Sampling)
## Chain 1: Iteration: 700 / 1000 [ 70%] (Sampling)
## Chain 1: Iteration: 800 / 1000 [ 80%] (Sampling)
## Chain 1: Iteration: 900 / 1000 [ 90%] (Sampling)
## Chain 1: Iteration: 1000 / 1000 [100%] (Sampling)
## Chain 1:
## Chain 1: Elapsed Time: 3.656 seconds (Warm-up)
## Chain 1: 2.445 seconds (Sampling)
## Chain 1: 6.101 seconds (Total)
## Chain 1:
## Warning: There were 12 divergent transitions after warmup. See
## https://mc-stan.org/misc/warnings.html#divergent-transitions-after-warmup
## to find out why this is a problem and how to eliminate them.
## Warning: Examine the pairs() plot to diagnose sampling problems
## Warning: The largest R-hat is 1.36, indicating chains have not mixed.
## Running the chains for more iterations may help. See
## https://mc-stan.org/misc/warnings.html#r-hat
## Warning: Bulk Effective Samples Size (ESS) is too low, indicating posterior means and medians may be unreliable.
## Running the chains for more iterations may help. See
## https://mc-stan.org/misc/warnings.html#bulk-ess
## Warning: Tail Effective Samples Size (ESS) is too low, indicating posterior variances and tail quantiles may be unreliable.
## Running the chains for more iterations may help. See
## https://mc-stan.org/misc/warnings.html#tail-ess
With only 1000 iterations and 1 chain, we might not expect the model to converge (see additional guidance via Stan developers here: https://mc-stan.org/misc/warnings.html). A couple things to consider:
adapt_delta
. (this will slow the sampling down a little).
If you’re still getting these warnings, the model may be mis-specified
or data not informative.fit <- fit_dlm(time_varying = logit.s ~ CUI.apr,
data = SalmonSurvCUI,
chains=1,
iter=1000,
control = list(adapt_delta=0.99))
max_treedepth
) depth with the
followingfit <- fit_dlm(time_varying = logit.s ~ CUI.apr,
data = SalmonSurvCUI,
chains=1,
iter=1000,
control = list(max_treedepth=15))
We can extract tidied versions of any of the parameters with
## # A tibble: 169 × 3
## term estimate std.error
## <chr> <dbl> <dbl>
## 1 eta[1] -3.50 0.333
## 2 eta[2] -3.35 0.297
## 3 eta[3] -3.54 0.281
## 4 eta[4] -3.27 0.281
## 5 eta[5] -3.62 0.314
## 6 eta[6] -3.44 0.305
## 7 eta[7] -3.99 0.298
## 8 eta[8] -4.23 0.266
## 9 eta[9] -4.66 0.270
## 10 eta[10] -5.26 0.379
## # ℹ 159 more rows
We also have a helper function for extracting and visualizing trends for time varying parameters.
## $plot
##
## $b_varying
## # A tibble: 84 × 5
## term estimate std.error par time
## <chr> <dbl> <dbl> <chr> <int>
## 1 b_varying[1,1] -3.33 0.494 (Intercept) 1
## 2 b_varying[2,1] -3.35 0.300 (Intercept) 2
## 3 b_varying[3,1] -3.32 0.337 (Intercept) 3
## 4 b_varying[4,1] -3.24 0.313 (Intercept) 4
## 5 b_varying[5,1] -3.43 0.326 (Intercept) 5
## 6 b_varying[6,1] -3.61 0.282 (Intercept) 6
## 7 b_varying[7,1] -3.89 0.284 (Intercept) 7
## 8 b_varying[8,1] -4.27 0.264 (Intercept) 8
## 9 b_varying[9,1] -4.70 0.266 (Intercept) 9
## 10 b_varying[10,1] -4.95 0.353 (Intercept) 10
## # ℹ 74 more rows
This function returns a plot and the values used to make the plot
(b_varying
)
As a second example, we could fit a model with a constant effect of
CUI, but a time varying slope. The fit_dlm
function has two
formula parameters, formula
for static effects, and
time_varying
for time varying ones. This model is specified
as
We could do the same with a model that had a time varying effect of CUI, but a time constant slope.