nomic-ai / gpt4all-datalake

API to the GPT4All Datalake
Apache License 2.0
379 stars 72 forks source link

A question about data access #6

Open alexey-milovidov opened 1 year ago

alexey-milovidov commented 1 year ago

I'm trying to download the data, but the command

$ aws s3 sync s3://gpt4all-datalake ./datalake_dump

returns an error:

fatal error: An error occurred (AccessDenied) when calling the ListObjectsV2 operation: Access Denied
AndriyMulyar commented 1 year ago

Working on this, S3 misconfigured. If you want a dump ask in the discord and will send it to you.

The latest data dump is located at. https://atlas.nomic.ai/map/gpt4all-datalake

typoworx-de commented 1 year ago

How large is the data-dump at all? @AndriyMulyar? Can i somehow also download from here: https://atlas.nomic.ai/map/gpt4all-datalake? Didn't find anything to download the dump or is it only accessible from there without full-dump download?

xnought commented 1 year ago

Are there any plans to make the download process easier? (I still get the error the original poster had)

AndriyMulyar commented 9 months ago

@xnought The bucket is configured for public export now.

Will be updating with an easier method for download soon.

secretyjc commented 1 month ago

any update?