epfLLM / meditron

Meditron is a suite of open-source medical Large Language Models (LLMs).
https://huggingface.co/epfl-llm
Apache License 2.0
1.77k stars 159 forks source link

Data preparation not working #14

Open fblgit opened 7 months ago

fblgit commented 7 months ago

Overall this part doesn't works, the scripts seems to have wrong paths, and the part of selenium.. etc

If you sort it out we'll be making a UNA version of your model, hope it can help on your research.

bluesky333 commented 7 months ago

I also encountered the same issue.

th789 commented 5 months ago

I'm encountering a similar problem (also related to paths) when running python finetuning/sft.py --checkpoint=meditron --size=7 --run_name=pubmedqa following the instructions here. Are there any updates? Thank you!