project-baize / baize-chatbot

Let ChatGPT teach your own chatbot in hours with a single GPU!
https://arxiv.org/abs/2304.01196
GNU General Public License v3.0
3.16k stars 282 forks source link

How to generate chat data with FAQ dataset #33

Closed xueyouluo closed 1 year ago

xueyouluo commented 1 year ago

Hi, I want to use this code to finetune my own FAQ dataset and make it possible to answer FAQ. From the collect.py, it seems you only use the question to generate the chat dataset.

Do you have any suggestions on how to generate chat dataset with FAQ dataset? is it possible to give Chatgpt the topic(question) and answer, and let it generate chat?

thanks

xueyouluo commented 1 year ago

Oh, I can make it just like alpaca_chat_data