Open glwagner opened 2 months ago
I guess a benefit of using Scratch
is that users will only have to download huge files / datasets once per filesystem, and then they can use that data in many projects.
Another possibility is to try to do both; ie use Scratch
to avoid redownloading but also somehow document the location of the data so users can inspect.
Right, it is indeed convenient to have the data in the local directory for inspection, so a hybrid implementation might be advantageous.
For those interested, the way it's used is within Bathymetry.__init__()
:
which generates the global variable download_bathymetry_cache
(a directory) which is then used here:
Right, it is indeed convenient to have the data in the local directory for inspection, so a hybrid implementation might be advantageous.
We could just create a symbolic link in the current working directory?
I like that solution!
I'd like to start this discussion here. The README for
Scratch.jl
states:I think that at least some users will want to interact with the files that configure a ClimaOcean simulation. For example, we may be interested in inspecting the high-resolution bathymetry file that was used to generate bathymetry for a certain test case. Or atmospheric forcing data. At least, that is common for typical workflows in other packages. Possibly, with ClimaOcean this need will be reduced (we will see). But for now I think it's perhaps best to assume that users want to interact with the downloaded files, which means that we probably don't want to use
Scratch
at the moment.@simone-silvestri