project-baize / baize-chatbot

Let ChatGPT teach your own chatbot in hours with a single GPU!
https://arxiv.org/abs/2304.01196
GNU General Public License v3.0
3.16k stars 282 forks source link

Dutch Quora, Stack Overflow, Alpaca dataset released #34

Closed BramVanroy closed 1 year ago

BramVanroy commented 1 year ago

Hello

I saw that you released your dataset for everyone to use, so I translated it with OpenAI's model and released it on the HF Hub. I hope it helps others who want to work on Dutch.

You can find the Quora chat set and the Stack Overflow dataset in Dutch. I've also translated the Alpaca Cleaned dataset into Dutch and also converted it into the Baize format. Feel free to add it here or anywhere with a reference to the repository.

Best

Bram

guoday commented 1 year ago

Thanks Bram.

JetRunner commented 1 year ago

We just added a dedicated section in README for community efforts: https://github.com/project-baize/baize-chatbot/commit/24d15161485b2b8a6b33c637f21d5ae67413ba99

BramVanroy commented 1 year ago

Great, thanks!