-
Hi I am trying to benchmark GPT2-large and experienced RuntimeError: The size of tensor a (1024) must match the size of tensor b (1025) at non-singleton dimension 3.
The inputs should able to accep…
-
支持GPT2吗,例如:[GPT2-Chinese](https://github.com/Morizeyao/GPT2-Chinese),可以给出对应的示例吗,谢谢
-
The main issue with this error is related to network connectivity. Specifically, the problem occurred when the program attempted to download the required file (`/gpt2/resolve/main/config.json`) from t…
-
1. There is no Moe Inference example in Example, even though the https://www.deepspeed.ai/tutorials/mixture-of-experts-inference/ blog provides the link to generate_text.sh, but it's a normal GPT2 mod…
-
Environment:
- System: Ubuntu 22.04.2 LTS
- CUDA Version: cuda_12.1.r12.1/compiler.32688072_0
- nvcc: 12.1
I encounter an error when I execute:
```bash
make train_gpt2cu
```
Warring and …
-
Traceback (most recent call last):
File "/root/llm.c/train_gpt2.py", line 663, in
model = GPT.from_pretrained(args.model)
File "/root/llm.c/train_gpt2.py", line 210, in from_pretrained
…
-
### System Info
Hello,
It is my understanding that the gpt-2 tokenizer, obtained with` AutoTokenizer.from_pretrained("gpt2")`, should be invertible. That is, given a sentence `text`, we should h…
-
**The bug**
It appears that the latest versions of `transformers` (4.43.*) do not play nicely with GPT2 on MacOS-ARM. This is seen in our PR Gate, with errors:
```
FAILED tests/model_integration/…
-
* [The Illustrated GPT-2 (Visualizing Transformer Language Models) – Jay Alammar – Visualizing machine learning one concept at a time.](https://jalammar.github.io/illustrated-gpt2/)
* [完全图解GPT-2:看完…
-
Hello! I find that in npu-only mode, the output size is set to be 1 but I want to use this simulator to generate tokens more than 1. I use stats.tsv in share-gpt folder as cli_config( it contains two …