How to mine fine-tuning samples from specified corpora

mshumer / gpt-llm-trainer

MIT License

3.92k stars 503 forks source link

How to mine fine-tuning samples from specified corpora #25

Open glacierck opened 5 months ago

glacierck commented 5 months ago

How to expand the system to limit the generation of fine-tuning samples based on a given set of corpus documents, rather than blindly fabricating them。 For example, generating fine-tuning samples for disease diagnosis, I hope it is based on the case in the uploaded real diagnosis report