jina-ai / jerboa

LLM finetuning
Apache License 2.0
41 stars 4 forks source link

Add red pajamas instruct dataset to our pipeline #49

Closed alaeddine-13 closed 1 year ago

alaeddine-13 commented 1 year ago

After the release of redpajamas 7b, the instruction training dataset was released as well. Therefore we can integrate the new dataset in our current pipeline