Add option to skip downloading output from S3 to local for AWS runs

openml / automlbenchmark

OpenML AutoML Benchmarking Framework

MIT License

391 stars 130 forks source link

Would welcome a PR. I propose to make this configurable by adding a parameter to the aws namespace in the configuration (e.g., aws.download). Ideally it would support three options:

All: downloads all files, current behavior, should remain default
Results: download only result files (the _download_results function internally already identifies those, so it should be easy to filter?)
None: don't download files

I don't know from the top of my head whether or not downloading the results file is always required just for the remainder of the logic to work (to know whether a task has finished). If it is, then None could simply choose not to save it to disk (or if there's a non-invasive way to allow it to finish the task without downloading the file, that would work too).

openml / automlbenchmark

Add option to skip downloading output from S3 to local for AWS runs #578