tgi Search Results - Githubissues

1000+ results
for tgi

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/text-generation-inference #1781

Llama-3 support

### Feature request I tried to run LLama-3 on TGI (1.3). The model kind of works, but it doesn't stop at the EOS tokens. I suspect TGI doesn't "understand" Llama-3's new tokenization scheme and promp…

RomanKoshkin updated 7 minutes ago
24
huggingface/optimum-neuron #515

Request TGI NeuronX DLC to support Flan T5 models in SageMak…

### Feature request Currently TGI NeuronX loads the artifacts with NeuronModelForCausalLM class, which gives error when loading Flan T5. ``` Unrecognized configuration class for this kind of Aut…

Neo9061 updated 8 hours ago
4
noah-severyn/SC4Cleanitol #1

Export scanned TGIs

Ability to export TGIs as CSV and/or SQLite DB for individual analysis.

noah-severyn updated 8 months ago
1
opendatahub-io/caikit #13

Caikit/TGIS ADR

**Acceptance criteria:** - [x] Architecture diagram for Caikit/TGIS and ODH/RHODS - [X] ADR

heyselbi updated 9 months ago
2
huggingface/text-generation-inference #2146

DeepSeek Coder V2: sharded is not supported for AutoModel

### System Info ghcr.io/huggingface/text-generation-inference:2.0.4 & 2.1.0 Ubuntu 22.04 server, 8xA6000. ### Information - [X] Docker - [ ] The CLI directly ### Tasks - [X] An officially suppor…

freegheist updated 1 day ago
2
predibase/lorax #329

Want Lorax with newer version of TGI

### Feature request hello，our models are deploying with TGI(v1.4.3), and we alse want to use lorax. But I find that the tgi version lorax is based on is very different with TGI version v1.4.3。 We …

yangelaboy updated 2 months ago
5
h2oai/h2ogpt #1419

Updated version causes error with TGI

I have updated my h2ogpt docker version and now my docker wan't start. The error is thrown : Malformed inference server. It is a bug because few lines before in the command line it succeeded generat…

vitalyshalumov updated 4 months ago
4
deepjavalibrary/djl-serving #2093

Llama 2 7b chat model output quality is low

I have a finetuned llama 2 7B chat model which I am deploying to an endpoint using DJL container. After deploying when I tested the model, the model output quality has degraded (The output seems to be…

VrushaliJoshi-v37040 updated 5 days ago
4
huggingface/text-generation-inference #1956

TGI does not always preserve order of grammar's JSON keys/Py…

### System Info Tests run via dedicated endpoints and Idefics2. TGI version was probably 2.0.2 ### Information - [X] Docker - [ ] The CLI directly ### Tasks - [X] An officially suppo…

MoritzLaurer updated 2 days ago
3
huchenlei/ComfyUI_omost #41

prompt generation very slow

For 4096 token(which is forced by omost), use llama-3 model at 4090, it take 120s to complete prompt. And it take only 7s for SD. It's a big gap. How can we accelerate the local GPT?

sipie800 updated 2 weeks ago
1

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for tgi

1000+ results
for tgi