Open robbibt opened 7 months ago
@SpacemanPaul
Probably best handled in the 1.9 branch after #1505 is merged.
Add skip_broken_datasets
as a config option, defaulting to False. Will automatically be over-rideable per environment with e.g. ODC_PROD_SKIP_BROKEN_DATASETS
and/or ODC_DEV_SKIP_BROKEN_DATASETS
.
Over the past few months we've been encountering intermittent GDAL access issue semi-regularly. e.g.:
This is a real pain, particularly in automated testing where a random fail can cause us to need to re-run our entire slow test suite.
datacube.load
has a handyskip_broken_datasets
param that can be used to workaround this issue. However, we don't really want to set this in every notebook/script as it adds complexity and potentially makes workflows non-reproducible.Thougts on adding support for a global environmental variable (e.g.
ODC_SKIP_BROKEN_DATASETS
etc) that could be set to force datacube to skip broken datasets, even if this wasn't set in Python code itself? This would allow us to set this in our tests, allowing the tests to be more robust to these issues without impacting user code.