gpt2-inference-performance Search Results

236 results
for gpt2-inference-performance

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ggerganov/ggml #220

ggml : unified file format

Obsoletes #147, #150, https://github.com/ggerganov/llama.cpp/issues/1575, https://github.com/ggerganov/llama.cpp/issues/1590, https://github.com/rustformers/llm/discussions/143, and probably some othe…

philpax updated 11 months ago
82
huggingface/accelerate #1873

Unable to reproduce logits with the deepspeed integration

### System Info ```Shell - `Accelerate` version: 0.20.3 - Platform: Linux-5.15.0-1023-aws-x86_64-with-glibc2.2.5 - Python version: 3.8.11 - Numpy version: 1.24.3 - PyTorch version (GPU?): 2.0.1+c…

vwxyzjn updated 1 year ago
2
BlackSamorez/tensor_parallel #67

How to load lora weights？

After fine-tuning llama with lora, how to load through multi-gpu? Are there any examples?

Vincent131499 updated 1 year ago
13
marella/ctransformers #8

Segmentation fault on m1 mac

Trying simple example on m1 mac: ``` from ctransformers import AutoModelForCausalLM llm = AutoModelForCausalLM.from_pretrained( "/path/to/starcoderbase-GGML/starcoderbase-ggml-q4_0.bin", …

s-kostyaev updated 1 year ago
65
pytorch/serve #1983

My run of Huggingface Transformers' sample Text Generation f…

### 🐛 Describe the bug I want to deploy the gpt2 model, I have deployed the environment on my server (centos 7), and then run the sample Text Generation of Huggingface Transformers, but it fails. …

kreamyu updated 1 year ago
9
Significant-Gravitas/AutoGPT #1841

Prompt overflows aren't handled gracefully

### Duplicates - [X] I have searched the existing issues ### Steps to reproduce 🕹 _No response_ ### Current behavior 😯 When using Chinese text, the length increases after encoding, which may caus…

tony163163 updated 1 year ago
28
chijames/KERPLE #2

Ask for a simple implementation of a form that can be used i…

First, Thanks for sharing your great research. I have reviewed the paper and the code, and it appears to be a form of adding kerple bias to the attention score. However, since the code is in neo…

switiz updated 1 year ago
6
princeton-nlp/MeZO #8

gpt_neo not supported

:/

thistleknot updated 1 year ago
8
OpenNMT/CTranslate2 #1137

How to verify if my local build has openMP support

System: M1 Mac With vanilla ctranslate2 (installed via pip), I was unable to use more than 1 threads and was getting this warning when i try to increase the threads "The number of threads (intra_th…

maxcribe updated 1 year ago
30
huggingface/optimum #557

FP16 training for GPT2 broken again due to recent change in …

### System Info ```shell Optimum 1.5.1 Transformers 4.25.1 (the training was fine for 4.24.0) ``` ### Who can help? @JingyaHuang ### Information - [X] The official example scripts…

JingyaHuang updated 1 year ago
3

上一页 1...15 16 17 18 19 20 21...24 下一页

236 results for gpt2-inference-performance

236 results
for gpt2-inference-performance