roboyoshi / datacurator-filetree

a standard filetree for /r/datacurator [ and r/datahoarder ]
https://reddit.com/r/datacurator
MIT License
1.47k stars 136 forks source link

Add dataset section for leaks (wikileaks, jfk files etc.) #26

Closed SebRut closed 4 years ago

roboyoshi commented 5 years ago

Good point, will probably check the-eye and other sources to find a good base directory. for now I'd consider them dumps of the www (so /root/web/)

roboyoshi commented 4 years ago

Another update here: Other datasets like: DB Dumps / CSV Files, Image Collection (CIFAR-10/CIFAR-100) are probably also included here. Its one of the few I'd consider worthy of another top-level folder..

roboyoshi commented 4 years ago

Not sure why I never closed this, but news (incl. leaks) as well as datasets now go into root/archives.