jina-ai / jerboa

LLM finetuning
Apache License 2.0
42 stars 4 forks source link

Pipeline training dataset refactoring #51

Closed alaeddine-13 closed 1 year ago

alaeddine-13 commented 1 year ago

Currently, our training pipeline supports only 1 dataset that conform to the alpaca format by using the dataset name. We need to ensure that: