gpt2 Search Results - Githubissues

1000+ results
for gpt2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

minimaxir/gpt-2-simple #171

Multi-gpu for gpt2.generate

When passing multi_gpu=True to load_gpt2 it appears to have no affect whatsoever on the speed of generation using gpt2.generate while using a fine-tuned 117M model with any number of nsamples. Does…

picocms updated 3 years ago
2
stanford-crfm/mistral #187

tutorial doesn't work out of the box

if i run `python train.py --config conf/tutorial-gpt2-micro.yaml` I get: FileNotFoundError: [Errno 2] No such file or directory: '/scr/dlwh/runs/gpt2-small-d=dlwh/wikitext_103_detokenized-n=-1-g=-1…

dlwh updated 4 months ago
3
go-skynet/go-ggml-transformers.cpp #39

Unable to load GPTNeoX model Pythia-70m-q4_0.bin

``` root@5dac227a29e8:~# LIBRARY_PATH=$PWD C_INCLUDE_PATH=$PWD /usr/local/go/bin/go run /root/go-ggml-transformers.cpp/examples/main.go -m "/models/pythia-70m-q4_0.bin" -t 14 gpt2_model_load: loadi…

chrisbward updated 1 year ago
1
yangjianxin1/GPT2-chitchat #126

ZeroDivisionError: division by zero

Traceback (most recent call last): File "D:\desktop\GPT2-chitchat-master\train.py", line 427, in main() File "D:\desktop\GPT2-chitchat-master\train.py", line 423, in main train(model,…

Delimeng updated 6 months ago
6
songmzhang/DSKD #11

qwen

针对qwen的SFT for student models 代码没看到

zjjznw123 updated 2 months ago
3
microsoft/LoRA #170

[Question about multi-gpu training]

when I try to train NLG model on multi-gpu,I use this: ``` python -m torch.distributed.launch --nproc_per_node=2 --use_env src/gpt2_ft.py \ --train_data ./data/e2e/train.jsonl \ --valid_d…

FindTheTruth updated 4 months ago
2
karpathy/llm.c #18

write LLVM optimization passes for train_gpt2

### Here is a little example: multiplications where one operand is a power of 2 and a constant integer, are optimized with a shift operation and the shift amount is calculated using the logBase2 of…

ent0n29 updated 7 months ago
2
microsoft/onnxscript #806

`proto2python` should not codegen large tensor data as 'make…

Otherwise generated python code is too large with model parameters defined explicitly. Example with gpt2 model. ```python import torch import transformers import onnxscript model = transform…

BowenBao updated 1 year ago
2
assafelovic/gpt-researcher #614

Code halts if OpenAI rate limit is reached. openai.RateLimit…

After spending quite a bit of time and using a chunk of my resources code suddenly halted just to tell me that I need to "wait for 2.75s", but there is no option to continue research. The exception is…

JustUser1410 updated 1 month ago
7
ECNU-ICALK/MELO #3

Result on Hallucination with GPT2-XL

Hi, thanks for your nice work. I've downloaded the repo and tried to reimplement the results from the paper. But I've got different results on the Hallucination dataset with GPT2-XL. I'm running with …

JiaangL updated 8 months ago
2

上一页 1...9 10 11 12 13 14 15...100 下一页

1000+ results for gpt2

1000+ results
for gpt2