Right now regression data takes up a significant amount of space in the repo. I suppose this is not a huge issue as only developers/contributors git clone the repo while users can just ] add Oceananigans.
But a potential solution would be to store regression data elsewhere and access it using DataDeps.jl. This might be especially good if we want more/larger regression tests. And it wouldn't increase the repo size every time you have to change the regression data.
We could maybe the store on engaging? Ideally it should be hosted somewhere with near 100% uptime as we're already maintaining Buildkite which fails sometimes, so we should try to reduce the number of possible failure points in our CI pipeline.
Copy pasted some analysis below:
I think regression files currently take up ~17.1 MiB of space in the git repo while a fresh clone of the repo is ~43 MiB (images and convergence plots probably take up several MiB).
Another use as mentioned by @christophernhill is to host initial condition / state data for examples (which only download if users need to run the examples in question?)
Right now regression data takes up a significant amount of space in the repo. I suppose this is not a huge issue as only developers/contributors
git clone
the repo while users can just] add Oceananigans
.But a potential solution would be to store regression data elsewhere and access it using DataDeps.jl. This might be especially good if we want more/larger regression tests. And it wouldn't increase the repo size every time you have to change the regression data.
We could maybe the store on engaging? Ideally it should be hosted somewhere with near 100% uptime as we're already maintaining Buildkite which fails sometimes, so we should try to reduce the number of possible failure points in our CI pipeline.
Copy pasted some analysis below:
I think regression files currently take up ~17.1 MiB of space in the git repo while a fresh clone of the repo is ~43 MiB (images and convergence plots probably take up several MiB).
Here's a listing of all files in git history over 300 KiB (command from https://stackoverflow.com/a/42544963):