Different results from BSTS models in Windows v/s Linux environments

Hi Steve,

We are using BSTS models for a certain forecasting project, and while the models are working quite well on our local Windows systems that we used for prototyping the code, we notice performance degradation on average while trying to deploy them on a Linux environment (Ubuntu 20.04). To diagnose what might be going wrong, we performed a simple experiment with R's iris dataset. Attaching the results where we tried varying the environment, the version of base R used, as well as differing versions of the library itself.

PFB the code used to arrive at the above results:

`library(bsts) library(datasets)

train <- iris[1:100, 1:3] test <- iris[101:150, 1:3]

state <- list() state <- bsts::AddLocalLevel(state, train$Sepal.Length) model <- bsts::bsts(Sepal.Length ~ ., state.specification=state, niter = 1000, data = train, seed = 2020, ping = 0)

burn <- 100 preds <- bsts::predict.bsts(model, newdata=test, seed=2020, burn=burn) preds$mean

colMeans(model$coefficients[-(1:burn), ])`

A deeper dive in the source code leads me to believe this behaviour might be arising due to using srand() to set the seed within the seed_rng_from_R.cpp file. Though in most cases the differences in results are not as significant, since we are modeling on log(x) rather than x, these differences tend to be exaggerated while evaluating prediction accuracy on the actual data.

steve-the-bayesian / BOOM

Different results from BSTS models in Windows v/s Linux environments #55