Which model is the bigcode/starcoder model trained on?

bigcode-project / starcoder

Home of StarCoder: fine-tuning & inference!

Apache License 2.0

7.28k stars 518 forks source link

Hi. This script was not used to perform the pre-training of starcoder. StarCoder was trained on a vast amount of code, the training data is available here . This code is designed for instruction fine-tuning. This code is specifically designed for starCoder, using another model could require some modifications namely here for example. Using --model_path meta-llama/llama-2-13b-hf would do the instruction fine-tuning of llama on the dataset you passed at --dataset_name. If the dataset is mainly about coding instructions, you are likely to have a better performance (coding assistant) if you use starcoder than if you use llama-2-13b.

bigcode-project / starcoder

Which model is the bigcode/starcoder model trained on? #121