Closed l-k-11235 closed 3 months ago
This PR enables multiple "response patterns" in LLM-like training examples (with prompt and response) for multi-task finetuning.
This PR enables multiple "response patterns" in LLM-like training examples (with prompt and response) for multi-task finetuning.