gpt2 Search Results - Githubissues

1000+ results
for gpt2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

karpathy/nanoGPT #138

gpt2-xl

Can we train gpt2-xl on nanoGPT? If possible，where's its datasets?

zscwind updated 1 year ago
1
lm-sys/FastChat #3296

What is gpt2-chatbot

Hi, Thank you for releasing the Arena. Which model is `gpt2-chatbot`? Thanks!

fakerybakery updated 6 months ago
6
balisujohn/tortoise.cpp #18

Optimize GPT2 inference: Remove redundant `autoregressive_la…

Thank you for this excellent implementation. I'd like to suggest an optimization that could significantly speed up inference and enable streaming output. Currently, there are two GPT2 graphs: 1.…

candlewill updated 2 weeks ago
3
microsoft/Megatron-DeepSpeed #329

how to convert deepspeed model to megatron, when pp=2, tp=2,…

i train the model using 2 nodes, and copy machine1's model files to the machine2's directory. then i use python deepspeed_to_megatron.py --input_folder $checkpoint --output_folder output --tar…

lonelydancer updated 1 month ago
2
iree-org/iree-turbine #117

can not use torch.compile with simple gpt2 test

```python import torch from transformers import AutoTokenizer, AutoModelForCausalLM @torch.compile(backend="turbine_cpu") def test_gpt2_demo(): tokenizer = AutoTokenizer.from_pretrained("gp…

vivekvpandya updated 2 months ago
2
microsoft/Megatron-DeepSpeed #364

Bugs in GPT2 Inference Example

1. There is no Moe Inference example in Example, even though the https://www.deepspeed.ai/tutorials/mixture-of-experts-inference/ blog provides the link to generate_text.sh, but it's a normal GPT2 mod…

JianzheXiao updated 5 months ago
3
Pawandeep-prog/finetuned-gpt2-convai #1

Fine tuning gpt2

NameError: name 'ChatData' is not defined code below: from transformers import GPT2LMHeadModel, GPT2Tokenizer #from ChatData import ChatData from torch.optim import Adam from torch.utils.da…

MordhwajSinghYadav updated 10 months ago
1
pytorch/benchmark #2543

How to get benchmark statistics?

I'm building a CI to test some models on certain types of devices. I want get benchmark statistics like which model cases failed? which tests were skipped and why? These statistics will be used to gen…

shink updated 1 week ago
4
keras-team/keras-hub #1158

Error while using Custom preprocessor

In @chenmoneygithub scipy talk, the KerasNLP uses a custom preprocessor ``` custom_preprocessor = keras_nlp.models.GPT2CausalLMPreprocessor.from_preset( "gpt2_base_en", sequence_length=8…

shivance updated 2 days ago
2
keras-team/keras-hub #792

Benchmark GPT2 `generate` method

We have added a `generate()` method to `GPT2CausalLM`, and we need a way to benchmark this API since performance is a key to text generation. More details will be added soon.

chenmoneygithub updated 7 months ago
1

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for gpt2

1000+ results
for gpt2