GlobalMaksimum / sadedegel

A General Purpose NLP library for Turkish
http://sadedegel.ai
MIT License
93 stars 15 forks source link

Remove datasets requiring access key #321

Open askarbozcan opened 1 year ago

askarbozcan commented 1 year ago

Most of datasets are hosted on S3, requiring access key and secret code making them inaccessible for pretty much all users. ALL data if provided as part of Sadedegel should be open. Issues with "private" datasets are as such: 1) Part of Sadedegel becomes inaccessible as most users won't bother asking for the secret code. 2) Sadedegel is a library first and this part should be emphasized.

dafajon commented 1 year ago

I addressed that on this PR. https://github.com/GlobalMaksimum/sadedegel/pull/315