gpt2 Search Results - Githubissues

1000+ results
for gpt2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

intel-analytics/ipex-llm #11951

[All-in-one benchmark] [GPT2-large] The size of tensor a (10…

Hi I am trying to benchmark GPT2-large and experienced RuntimeError: The size of tensor a (1024) must match the size of tensor b (1025) at non-singleton dimension 3. The inputs should able to accep…

Kpeacef updated 1 week ago
1
Tencent/TurboTransformers #80

GPT2

支持GPT2吗，例如：[GPT2-Chinese](https://github.com/Morizeyao/GPT2-Chinese)，可以给出对应的示例吗，谢谢

kawa23 updated 4 years ago
2
HarliWu/FedBiOT #2

connection timeout about Hugging Face server

The main issue with this error is related to network connectivity. Specifically, the problem occurred when the program attempted to download the required file (`/gpt2/resolve/main/config.json`) from t…

gf457832386 updated 2 days ago
2
microsoft/Megatron-DeepSpeed #364

Bugs in GPT2 Inference Example

1. There is no Moe Inference example in Example, even though the https://www.deepspeed.ai/tutorials/mixture-of-experts-inference/ blog provides the link to generate_text.sh, but it's a normal GPT2 mod…

JianzheXiao updated 3 months ago
3
karpathy/llm.c #359

Error: make: *** [Makefile:203: train_gpt2cu] Error 255

Environment: - System: Ubuntu 22.04.2 LTS - CUDA Version: cuda_12.1.r12.1/compiler.32688072_0 - nvcc: 12.1 I encounter an error when I execute: ```bash make train_gpt2cu ``` Warring and …

yushengsu-thu updated 2 weeks ago
7
karpathy/llm.c #723

TypeError: normal_() got an unexpected keyword argument 'gen…

Traceback (most recent call last): File "/root/llm.c/train_gpt2.py", line 663, in model = GPT.from_pretrained(args.model) File "/root/llm.c/train_gpt2.py", line 210, in from_pretrained …

StarHtimE updated 1 month ago
1
huggingface/transformers #31884

[BUG] GPT-2 tokenizer is NOT invertible

### System Info Hello, It is my understanding that the gpt-2 tokenizer, obtained with` AutoTokenizer.from_pretrained("gpt2")`, should be invertible. That is, given a sentence `text`, we should h…

jdeschena updated 6 days ago
14
guidance-ai/guidance #965

[Bug] Latest Transformers disagrees with GPT2 on MacOS-ARM

**The bug** It appears that the latest versions of `transformers` (4.43.*) do not play nicely with GPT2 on MacOS-ARM. This is seen in our PR Gate, with errors: ``` FAILED tests/model_integration/…

riedgar-ms updated 1 month ago
2
daviddwlee84/DeepLearningPractice #13

GPT2 Collection

* [The Illustrated GPT-2 (Visualizing Transformer Language Models) – Jay Alammar – Visualizing machine learning one concept at a time.](https://jalammar.github.io/illustrated-gpt2/) * [完全图解GPT-2：看完…

daviddwlee84 updated 3 years ago
1
casys-kaist/NeuPIMs #2

How to use this simulator to generate multiple output token…

Hello! I find that in npu-only mode, the output size is set to be 1 but I want to use this simulator to generate tokens more than 1. I use stats.tsv in share-gpt folder as cli_config( it contains two …

lhpp1314 updated 2 days ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for gpt2

1000+ results
for gpt2