gpt2-inference-performance Search Results

236 results
for gpt2-inference-performance

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/optimum #557

FP16 training for GPT2 broken again due to recent change in …

### System Info ```shell Optimum 1.5.1 Transformers 4.25.1 (the training was fine for 4.24.0) ``` ### Who can help? @JingyaHuang ### Information - [X] The official example scripts…

JingyaHuang updated 1 year ago
3
huggingface/transformers #21202

batched feature extraction pipeline for GPT-style models

### Feature request it would be nice to support feature extraction of batched input for GPT-style models using `Pipeline`s ### Motivation I'm currently trying to generate encodings of a large numbe…

davidegraff updated 1 year ago
7
pytorch/pytorch #96191

[TESTING] Dashboard -- AMP

desertfire updated 1 year ago
4
pytorch/pytorch #96192

[TESTING] Dashboard -- float32

desertfire updated 1 year ago
4
microsoft/onnxruntime #10994

Could not find an implementation for Trilu node - ORT Model …

**Describe the bug** We are seeing the following error when using the [ONNX Model optimizer](https://github.com/microsoft/onnxruntime/tree/master/onnxruntime/python/tools/transformers#model-optimizer…

mstfbl updated 1 year ago
4
microsoft/DeepSpeed #2081

How to set batch size for using deepspeed inference API?

Hi folks, I followed the tutorial from https://www.deepspeed.ai/tutorials/inference-tutorial/#end-to-end-gpt-neo-27b-inference and wrote below code to run gpt2-xl inference. ``` import os imp…

gtvforever updated 1 year ago
4
stochasticai/xTuring #69

Whether I could set multiple GPU to train the model?

Hi This is a very interesting work. I am curious whether I could set multi-GPUs to train the model as the original Alpaca did?

zhuang-li updated 1 year ago
7
huggingface/optimum #618

(ONNXRuntimeError) LoadLibrary failed with error 126

### System Info ```shell Optimum: 1.5.1 Python: 3.10.4 Platform: Windows 10 Cuda: 11.6 ``` ### Who can help? @JingyaHuang @echarlaix ### Information - [X] The official example scripts - [ ] …

Eichhof updated 1 year ago
12
pytorch/pytorch #79112

`cumsum` op: pytorch failed to run GPT-2 model in M1's MPS d…

### 🐛 Describe the bug My transformers inference script is running successfully in device CPU, but when using device MPS in MacOS M1 Pro, it will report 'aten::cumsum.out' op is missing, so I set env…

liulhdarks updated 2 years ago
2
microsoft/DeepSpeed #2357

[BUG] Inference with batch size > 1 and long inputs

**Describe the bug** Responses for transformers models are not relevant with long inputs and batch size > 1. This issue is related to gpt-like models, while this [issue](https://github.com/microsoft/…

AlekseyKorshuk updated 1 year ago
10

上一页 1...16 17 18 19 20 21 22...24 下一页

236 results for gpt2-inference-performance

236 results
for gpt2-inference-performance