Closed henrykironde closed 3 years ago
Data-One Another good site to find datasets.
Kaggle required logging in to download data the last time I looked, which won't work for our current tooling. If they've implemented API key based access (something they were talking about the last time I talked with them) then we could talk about adding functionality to allow users to specify an API key for datasets that require it.
Kaggle has a very solid, cli based api for downloading and uploading datasets. The api requires an access_token
in the ~/.kaggle folder. User could allow retriever package
to read that access_token
and accordingly perform the required actions or we could take the key as a parameter.
@DumbMachine - ah that's great! I'm :+1: for adding support for reading the api key and then adding the Kaggle datasets. When adding this support we'll need messaging to users that indicates that they need to get the api key if they try to download a kaggle dataset and don't have one. Would you be up for opening an issue describing this feature addition?
@ethanwhite I would love to open the issue and describe it further. I'll do it soon.
Added to the latest data collection # 1575
Feel free to edit and add more dataset links or sources.