Open jmarsil opened 4 years ago
I was able to find the s3 bucket locations of the pretrained GPT2 models here: https://github.com/huggingface/transformers/blob/master/transformers/modeling_gpt2.py (provided by HuggingFace).
To make this work, just download gpt2-xl model instead:
curl --output gpt2-pytorch_model.bin https://s3.amazonaws.com/models.huggingface.co/bert/gpt2-xl-pytorch_model.bin
@jasonzhou1 I only get gibberish output with the XL model, worse than the small version. Have you actually had any luck with it?
Update: also tried the other models linked to in the script you referenced, also without luck.
@jasonzhou1 I only get gibberish output with the XL model, worse than the small version. Have you actually had any luck with it?
Before you try gpt-2-ml model,some parameters in gpt-2-Pytorch/GPT2/config.py should be modified , like n-heads=25 , n_embd=1600 , n_layer=25, or you can see details here https://s3.amazonaws.com/models.huggingface.co/bert/gpt2-xl-config.json
Any help would be greatly appreciated!