exasol / ai-lab

Development environment for data science developers
MIT License
3 stars 0 forks source link

Spike: Long-term solution for cloud storage extension dataset #248

Open Shmuma opened 6 months ago

Shmuma commented 6 months ago

Around beginning of March, Y8M dataset we used for cloud storage extension notebook was closed. Created a ticket for dataset provider: https://github.com/aws-samples/data-lake-as-code/issues/28 And ticket for replacing the dataset: https://github.com/exasol/ai-lab/issues/247

But we need long-term solution for such situations. It is risky to put some data on public dataset, as it could be used for "payment attack" when data is requested and AWS bills grow infinitely.

Possible solutions we discussed with @redcatbear and @tkilias:

Those options require deeper investigation.