-
!pip install transformers datasets
from transformers import GPT2Tokenizer, GPT2LMHeadModel, Trainer, TrainingArguments
from datasets import load_dataset, load_metric
from transformers import GPT2LMH…
-
## Bug Description
Configuration : llm_examples_main branch, current torch version : 2.4, transformers==4.41.2
Error message:
```py
File "/home/dperi/Downloads/TensorRT/examples/dynamo/torch_e…
-
When I am reproducing your paper, the parameters in the SMD dataset's metrics usage script are much different from the metrics in the paper, and I would like to get the best parameters from your paper…
-
python lm_eval/\_\_main\_\_.py --model hf --model_args pretrained=openai-community/gpt2 --tasks lambada_openai --device cuda:0 --batch_size 4
| Tasks |Version|Filter|n-shot| Metric…
-
If you are submitting a bug report, please fill in the following details and use the tag [bug].
**Describe the bug**
Gemma-2-{size} is not loadable using from_pretrained. I checked OFFICIAL_MODEL_…
-
hello, haohe. I really appreciate your work! Thank you for your kindness of open sourcing.
In the learning of the training code, I can not find the training of GPT2. In the original paper, embedding…
CJ416 updated
2 weeks ago
-
I am following the tutorial from Andrej K. building gpt2 from scratch. I thought it would be a good idea to visualize his GPT2 model using torchexplorer.
This what I did:
1. install torchexplore…
-
run `bash run_gpt2.sh` and raise the value error above.
-
Hi,
Thank you for releasing the Arena. Which model is `gpt2-chatbot`?
Thanks!
-
Hi @Liuhong99 ,
I am a big fan of sophia used it cited it everytime. Just thought of suggesting you a new and less resource intensive experiment.
a) Karpathy updated the nano_gpt2 training [cod…