gpt-neox Search Results

1000+ results
for gpt-neox

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

AutoGPTQ/AutoGPTQ #178

Example for quant Gpt-neox-20b

Hey guys, I am using `GPTQ` to quantize the `GPT-NeoX-20B` model. Previously, when quantizing the Llama Family model, I usually used `C4` as the example tokenizer. May I ask which dataset is suitable …

sgwhat updated 1 year ago
1
microsoft/DeepSpeed #3103

[BUG] GPT-NeoX Inference returns nonsense

**Describe the bug** Running inference with Deepspeed using GPT-NeoX 20B model produces garbage output, indicating an implementation bug. **To Reproduce** For example, can be seen when using exam…

ppetrushkov updated 1 year ago
1
casper-hansen/AutoAWQ #322

[BUG] Quantizing GPT NeoX raises an error

First of all, thank you for great work. ## System info autoawq==0.1.8 ## Details While I tried to quantize GPT NeoX model, encountered the error below. ``` >>> from awq import AutoAWQForCa…

kevin3314 updated 8 months ago
3
ggerganov/ggml #225

GPTNeoX model with 16k context results in context-size relat…

Hey guys Today I was doing quants of a [new GPTNeoX model called Literature-7B-16384](https://huggingface.co/hakurei/Literature-7B-16384) I tried making GGMLs through the usual process: ``` py…

TheBloke updated 1 year ago
2
VHellendoorn/Code-LMs #33

Converting models from GPT-NeoX to HuggingFace format

Hello, I am interested and volunteering to convert the models from GPT-NeoX to HuggingFace format.

sleekmike updated 2 years ago
9
NVIDIA/TensorRT-LLM #2357

openai_server error

System Info GPU： NVIDIA RTX 4090 TensorRT-LLM 0.13 quest 1: How can I use the OpenAPI to perform inference on a TensorRT engine model? root@docker-desktop:/llm/tensorrt-llm-0.13.0/examples/apps# pyt…

imilli updated 1 week ago
1
GaParmar/img2img-turbo #89

ValueError: Unrecognized model in /root/autodl-tmp/img2img-t…

My server cannot connect to the Hugging Face website, so I manually downloaded the pretrained model used in the code and placed it in the `img2img-turbo-main` folder. After executing the command `pyth…

YijiFeng updated 1 week ago
4
huggingface/optimum-habana #1152

Heavy IO in multi-node example

### System Info Optimum Habana: 1.10.4 Synapse: 1.14.0 Dockerfile: ``` FROM vault.habana.ai/gaudi-docker/1.14.0/ubuntu22.04/habanalabs/pytorch-installer-2.1.1:latest # Installs pdsh and upg…

rofinn updated 2 weeks ago
1
microsoft/DeepSpeed #2876

[BUG] [0.8.1] INT8 model loading/inference issue

**Describe the bug** We conducted tests on OPT/GPTJ/GPT-Neox/BLOOM 7B INT8, these models are all producing garbage outputs on DeepSpeed 0.8.1 OPT model is NCCL communication issue GPT-…

sindhuvahinis updated 1 year ago
9
huggingface/optimum #1915

Support for phi3-v Vision Model

### Feature request I encountered a KeyError while loading the phi3-v vision model into Optimum Huggingface. The error message states: ``` KeyError: 'phi3-v model type is not supported yet in Nor…

saaraahfar updated 4 weeks ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for gpt-neox

1000+ results
for gpt-neox