gptneox Search Results - Githubissues

212 results
for gptneox

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/FasterTransformer #704

[bug] gptneox decouped wrong output length

when using fastertransformer_backend decouped mode is True, the output will diff with decouped is False. And the output length is wrong ### Branch/Tag/Commit main ### Docker Image Version t…

RobotGF updated 1 year ago
1
NVIDIA/TensorRT-LLM #795

Loaded model not correctly sent to the process (GPTNeoX buil…

I am getting an error when using `TensorRT-LLM/examples/gptneox/build.py` to build the TensorRT engine: ``` line 314, in build_rank_engine assert hf_gpt is not None, f'Could not load weights …

ydm-amazon updated 8 months ago
1
rustformers/llm #320

WeightedIndex error: invalid weight

When trying to run Pythia model using gptneox, I got this error, btw I use termux on Android with rust installed to run this model. $ cargo run --release -- gptneox infer -m pythia-160m-q4_0.bin -…

andri-jpg updated 1 year ago
6
ggerganov/ggml #147

[RFC] Implement a mechanism to detect the type of model bein…

With all the variant of ML model out now - gpt2/gptneox/llama/gptj, I wonder if theres a way to infer the model's type from reading it?... Right now, if someone gives me a random model file with ob…

louisgv updated 1 year ago
7
go-skynet/go-ggml-transformers.cpp #39

Unable to load GPTNeoX model Pythia-70m-q4_0.bin

``` root@5dac227a29e8:~# LIBRARY_PATH=$PWD C_INCLUDE_PATH=$PWD /usr/local/go/bin/go run /root/go-ggml-transformers.cpp/examples/main.go -m "/models/pythia-70m-q4_0.bin" -t 14 gpt2_model_load: loadi…

chrisbward updated 1 year ago
1
NVIDIA/FasterTransformer #491

Garbage GPTNeoX Output when start_ids is long

### Branch/Tag/Commit main ### Docker Image Version nvcr.io/nvidia-pytorch:22.07-py3 ### GPU name A100 ### CUDA Driver 450.156.00 ### Reproduced Steps Follow the steps: Fast…

TopIdiot updated 1 year ago
2
NVIDIA/FasterTransformer #602

GPT-NeoX gives poor results using FP16

### Branch/Tag/Commit main ### Docker Image Version none ### GPU name T4 ### CUDA Driver 525.60.13 ### Reproduced Steps ```shell ## Steps 1. Download public GPT-NeoX Model https://huggingfac…

eycheung updated 11 months ago
1
ggerganov/ggml #304

Need RedPajama-3b support

https://github.com/togethercomputer/redpajama.cpp https://www.together.xyz/blog/redpajama-models-v1

madroidmaq updated 1 year ago
1
NVIDIA/FasterTransformer #465

GPTNeoX get wrong output when start_ids is long

### Branch/Tag/Commit main ### Docker Image Version nvcr.io/nvidia-pytorch:22.07-py3 ### GPU name A100 ### CUDA Driver 450.156.00 ### Reproduced Steps ```shell 1. download …

TopIdiot updated 1 year ago
5
h2oai/h2ogpt #87

NVIDIA Triton inference support

https://github.com/triton-inference-server/ - [x] Build Triton Docker image with support for FasterTransformer backend for Fusion etc. - [x] convert h2oGPT models to format that Triton understands h…

arnocandel updated 1 year ago
1

上一页 1...1 2 3 4 5 6 7...22 下一页

212 results for gptneox

212 results
for gptneox