gpt2-inference-performance Search Results

236 results
for gpt2-inference-performance

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/Megatron-LM #1134

[BUG] 'NoneType' object has no attribute 'shape' error raise…

Hi, It seems that the same code is **working fine with when the Megatron-LM that I git-cloned in April. With the latest Megatron-LM, I've got the following error raised with the pretrain_gpt.py code. …

hwang2006 updated 1 week ago
8
dzagardo/forgetnet #1

Any easy way to get this running for images? Maybe a small e…

aymuos15 updated 2 months ago
9
run-llama/llama_index #15730

[Question]: Can I use query engine with batch?

### Question Validation - [X] I have searched both the documentation and discord for an answer. ### Question Hi, thank you for your nice work. I left the question to ask the availability of batch g…

MINJIK01 updated 1 month ago
7
mindspore-lab/mindnlp #990

GPT2 inference performance need to be boosted, if we want to…

**Is your feature request related to a problem? Please describe.** When using mindnlp to infer GPT2, I found that the inference speed is 10X slower than pytorch. Here is the torch version implementa…

WilliamLiuAtCPC updated 2 months ago
1
microsoft/onnxruntime #13559

[Feature Request] TensorRT custom engine Plans

### Describe the feature request It would be great to have the option to provide pre-optimised TensorRT engine plans to ORT. ### Describe scenario use case Using TensorRT in standalone, e.g. trtex…

contentis updated 4 months ago
12
irthomasthomas/undecidability #640

README.md · defog/sqlcoder-7b-2 at main

- [ ] [README.md · defog/sqlcoder-7b-2 at main](https://huggingface.co/defog/sqlcoder-7b-2/blob/main/README.md?code=true) # README.md · defog/sqlcoder-7b-2 at main **DESCRIPTION:** ```yaml license:…

irthomasthomas updated 7 months ago
1
irthomasthomas/undecidability #625

unsloth/README.md at main · unslothai/unsloth

- [ ] [unsloth/README.md at main · unslothai/unsloth](https://github.com/unslothai/unsloth/blob/main/README.md?plain=1) # unsloth/README.md at main · unslothai/unsloth …

irthomasthomas updated 7 months ago
1
waifuoid/llmlingua-api #1

Need client demo

The code is concise and very helpful. Please provide a demo for the client.

huyinguo updated 7 months ago
1
microsoft/onnxruntime #15191

[Performance] GPT NEO: better performance of python GPT NEO …

### Describe the issue I implemented a program with GPT NEO in python (attached the program) and the equivalent version in C++. To acquire the exported GPT NEO model I made some slight modification…

Zapotecatl updated 1 year ago
3
microsoft/Olive #1075

Olive workflow for mistral model optimization does not work

**Describe the bug** Following the instructions in [`examples/mistral`](https://github.com/microsoft/Olive/tree/main/examples/mistral) does not result in a quantized onnx model. After running the wor…

jojo1899 updated 3 months ago
17

上一页 1...2 3 4 5 6 7 8...24 下一页

236 results for gpt2-inference-performance

236 results
for gpt2-inference-performance