bigscience-workshop / multilingual-modeling

BLOOM+1: Adapting BLOOM model to support a new unseen language
https://arxiv.org/abs/2212.09535
Apache License 2.0
69 stars 15 forks source link

Exp-001: Finetune gpt-2 model with new tokenizer on fr #3

Closed yongzx closed 2 years ago

yongzx commented 2 years ago

Added the following files to exp-001 folder.

Let me know if I should change the folder name – I like to use numbering for experimental runs and then indicate in the README.md file about the descriptions of the folder.

Will include the training result (log file) and checkpoints once they are ready. #2

yongzx commented 2 years ago

Added

@hadyelsahar I have requested access to the org https://huggingface.co/bigscience to use the bigscience/gpt2-350m-en checkpoint.