gpt2 Search Results - Githubissues

1000+ results
for gpt2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

facebookresearch/metaseq #692

Sub-workers exits without messages

## 🐛 Bug I use the script as follow: CUDA_VISIBLE_DEVICES="0, 1, 2, 3" metaseq-train --task streaming_language_modeling \ data/pile-test/ \ --num-workers 4 \ --reset-dataloader \ --vo…

GongZhengLi updated 2 months ago
7
onnx/models #593

Question about gpt2 model

# Ask a Question How should be the input when we want to do prediction on batch_size more than 1? ### Question Explain your question here. this one: texts = ["Here is some text to encode :…

saramohammadinejad updated 1 year ago
1
casys-kaist/NeuPIMs #2

How to use this simulator to generate multiple output token…

Hello! I find that in npu-only mode, the output size is set to be 1 but I want to use this simulator to generate tokens more than 1. I use stats.tsv in share-gpt folder as cli_config( it contains two …

lhpp1314 updated 2 months ago
1
ELS-RD/transformer-deploy #52

Support for gpt2 quantization

I tried to quantize (add QDQ layers) the gpt2 model: ``` batch_size=8 with QATCalibrate(method="histogram", percentile=99.999) as qat: model_q = self.model.cuda() …

kobzaond updated 2 years ago
6
dmlc/gluon-nlp #1567

GPT2 tests mysteriously killed

## Description GPT2 tests in tests/test_models.py is mysteriously killed. The was found in the recent [nightly tests](https://github.com/dmlc/gluon-nlp/actions/runs/811741483)(cu102-2.0.0b20210502 an…

barry-jin updated 3 years ago
3
intel-analytics/ipex-llm #11951

[All-in-one benchmark] [GPT2-large] The size of tensor a (10…

Hi I am trying to benchmark GPT2-large and experienced RuntimeError: The size of tensor a (1024) must match the size of tensor b (1025) at non-singleton dimension 3. The inputs should able to accep…

Kpeacef updated 2 months ago
1
minimaxir/gpt-2-simple #114

Add gpt2.reset_session()

Helper to reset session (instead of requiring restarting app)

minimaxir updated 5 years ago
1
NVIDIA/TensorRT-LLM #775

run.py failed for gpt

during Running run.py for gpt2-medium i am getting following error. RuntimeError: [TensorRT-LLM][ERROR] CUDA runtime error in cub::DeviceSegmentedRadixSort::SortPairsDescending(nullptr, cub_temp…

riyaj8888 updated 1 week ago
4
NVIDIA/Megatron-LM #743

[BUG]

**Describe the bug** The sequence length during training is different than specified, in the configs, I've specified seq-len 50016, which is divisible by the tensor-model-parallel-size=4, however, du…

lakshya-4gp updated 1 month ago
6
shap/shap #2617

Explainer Index out of range

Hi, I am trying out this great framework with a self trained GPT-2. I wanted to use a custom trained model and the base model as tokenizer. No matter if I use this approach or solely the base mo…

LukasFides updated 2 months ago
23

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for gpt2

1000+ results
for gpt2