gpt-neox Search Results

1000+ results
for gpt-neox

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

state-spaces/mamba #403

Support for Mamba2 Model and Tokenizer in the Transformers L…

I am interested in using the Mamba2 model with the `transformers` library. However, I've encountered several issues and have some questions: 1. **Model Accessibility:** It seems the Mamba2 model is…

LumenScopeAI updated 4 months ago
1
xrsrke/pipegoose #10

Kernel Fusion using torch.jit

Fuse some popular functions and automatically replace modules in an existing 🤗 transformers model with their corresponding fusion module **APIs** ``` from pipegoose.nn import fusion # and ot…

xrsrke updated 11 months ago
3
NVIDIA/apex #1166

Getting compiler segfaults?!

``` /usr/local/cuda-11.1/bin/nvcc -I/home/hugh/anaconda3/envs/gptserv/lib/python3.9/site-packages/torch/include -I/home/hugh/anaconda3/envs/gptserv/lib/python3.9/site-packages/torch/include/torch/c…

HughPH updated 2 years ago
3
vllm-project/vllm #2602

Add multi-LoRA support for more architectures

Currently, multi-LoRA supports only Llama and Mistral architectures. We should extend this functionality to all architectures. Yi, Qwen, Phi and Mixtral architectures seem to be the most demanded r…

Yard1 updated 1 week ago
7
NVIDIA/FasterTransformer #674

Are there plans to support INT8 PTQ for other models (GPTNeo…

Hello, It seems that currently int8 weight only and SmoothQuant quantizations are supported for GPT models, but no kind of quantization is supported for other autoregressive transformer models, suc…

aitorormazabal updated 1 year ago
1
artidoro/qlora #123

ValueError: Tokenizer class GPTNeoXTokenizer does not exist …

When I tried ``` !python qlora.py –learning_rate 0.0001 --model_name_or_path EleutherAI/gpt-neox-20b --trust_remote_code ``` in colab, i got following errors ``` 2023-06-03 13:54:17.113623: W t…

zhashen updated 5 months ago
7
triton-inference-server/fastertransformer_backend #164

Memory usage is doubled when loading a fp16 model into bf16

### Description ```shell Model: Gpt-NeoX GPU: A100 Tritonserver version: 22.12 ``` Hello, I'm not sure whether this is FasterTransformer's issue or backend's issue, but still I'm reporting i…

skyser2003 updated 7 months ago
2
AutoGPTQ/AutoGPTQ #460

https://github.com/Ph0rk0z/text-generation-webui-testing/com…

File "/UNICOMFS/hitsz_mzhang_1/.conda/envs/quantize/lib/python3.9/site-packages/transformers/models/gpt_neox/modeling_gpt_neox.py", line 155, in forward qkv = self.query_key_value(hidden_states…

expapa updated 11 months ago
1
EleutherAI/pythia #127

Has the data been shuffled?

Hello, I see your batch_view.py, found that the data does not use a shuffle, but in the gpt-neox library, the data is shuffled. So I want to make sure that the author did or did not shuffle during t…

Lisennlp updated 10 months ago
2
EleutherAI/gpt-neox #863

DeepSpeed Sparse Attention is Broken

SparseAttention relies on Triton for specific kernels. GPT-NeoX currently has as a dependency `triton==0.4.2`, which is behind the DeepSpeed version of `1.0.0`. It is far behind the version of Triton …

dashstander updated 1 year ago
2

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for gpt-neox

1000+ results
for gpt-neox