tgi Search Results - Githubissues

1000+ results
for tgi

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

MiuLab/Taiwan-LLM #59

Support for AWQ quantization in TGI

Hi As I tried with 13b version in TGI, it works fine with bitsandbytes quantization. While trying with AWQ quantization in TGI, it shows error as "Cannot load 'awq' weight, make sure the model is al…

nigue3025 updated 1 month ago
1
nyunAI/Faster-LLM-Survey #4

vllm dependency for TGI 2.0.1

I recently try to build TGI 2.0.1 again but encounter new error ``` Installed /server9/cbj/programming/anaconda3/envs/tgi_server/lib/python3.11/site-packages/typer-0.12.3-py3.11.egg error: h11 0.…

for-just-we updated 1 month ago
1
huggingface/optimum-neuron #287

TGI with LLaMa 2 example/tutorial

We require sample code and tutorial for running LLaMa 2 with TGI

mmcclean-aws updated 8 hours ago
6
huggingface/optimum-neuron #289

Speculative Sampling support for TGI

Speculative Sampling is a technique to improve throughput of LLMs and customers have requested it be supported with Inf2

mmcclean-aws updated 3 weeks ago
5
huggingface/text-generation-inference #1960

TGI hard crashes after 1 OOM error

### System Info TGI docker image on GCP. GPU: A100 Model: Phi-3 ### Information - [X] Docker - [ ] The CLI directly ### Tasks - [X] An officially supported command - [ ] My own modifications…

pranavthombare updated 3 weeks ago
4
huggingface/tgi-gaudi #149

LlaVa support

### Model description https://github.com/huggingface/text-generation-inference/pull/1709 Since the TGI has done the LlaVa support. Would like to know if there is any timeline for the LlaVa support o…

JoeyTPChou updated 1 week ago
2
jianyangg/local-llm #13

LLM Batch Processing

To allow multi-tenancy inferences - [ ] Explore vLLM/HuggingFace TGI - [ ] Fallback implement baseline FastAPI with batch processing

hanchingyong updated 2 weeks ago
1
opendatahub-io/caikit-tgis-serving #156

add readiness probe on TGIS container (caikit+tgis)

- model should be loaded - more? @Xaenalt

dtrifiro updated 3 months ago
10
huggingface/text-generation-inference #2130

[RFC]Add Auto-Round Support

Hi, here is the INC team from Intel. Thank you for developing this amazing project. ### Motivation Our team has developed a new weight-only quantization algorithm called Auto-Round. It has achie…

yiliu30 updated 6 hours ago
18
TrelisResearch/one-click-llms #5

DRBX Instruct TGI error

"text_generation_launcher: Method Warmup encountered an error." when the final stage . ``` 2024-03-30T14:16:55.598106565Z 2024-03-30T14:16:55.597709Z ERROR warmup{max_input_length=3000 max_prefil…

deter3 updated 3 months ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for tgi

1000+ results
for tgi