kbressem / medAlpaca

LLM finetuned for medical question answering
GNU General Public License v3.0
491 stars 58 forks source link

Question on ChatDoctor data #44

Open XZhang97666 opened 1 year ago

XZhang97666 commented 1 year ago

I saw there are 10000 data points used from ChatDoctor project. Which specific subset you are used?

kbressem commented 1 year ago

Just the first 10k of the dataset. However, the data has been updated by the chatdoctor authors in the meantime so this would no longer work.