tgi Search Results - Githubissues

1000+ results
for tgi

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

stanfordnlp/dspy #140

Reproducible results with HFModel or MLC

I noticed that the results are not reproducible. I am using Llama with BootstrapFewShot, and every time I compile the same program, I get totally different results (not even close). I noticed in the …

Sohaila-se updated 1 week ago
10
Maximilian-Winter/llama-cpp-agent #68

Request for image input support

I plan to implement the function calling with vision models such as LLaVA and Nous-Hermes-2-Vision-Alpha based on the image, but it seems that the current implementation in the example folder only sup…

reachsak updated 4 weeks ago
1
WisdomShell/codeshell #44

CodeShell-7B-Chat-int4 在TGI下运行失败

您好~感谢开源CodeShell大模型，我在尝试TGI运行CodeShell-7B-Chat-int4时遇到了一些问题，如果您能帮忙解决的话将不胜感激！ ``` docker run --gpus 'all' --shm-size 20g -p 9090:80 -v /root/codeshell/model:/data --env LOG_LEVEL="info,text_g…

AP-Kai updated 3 months ago
5
huggingface/chat-ui #657

Error: Server does not support event stream content type, it…

I want to deploy a few open source models with the chat UI. I started a simple model with: ``` model=tiiuae/falcon-7b-instruct volume=$PWD/data # share a volume with the Docker container to avoid…

scchess updated 4 months ago
1
EleutherAI/lm-evaluation-harness #869

TGI support - API evaluation of HF models

Since HF TGI's [PR](https://github.com/huggingface/text-generation-inference/pull/617) was merged, it should possible to integrate TGI endpoints to the lm-evaluation-harness supported APIs. Any pl…

ManuelFay updated 8 months ago
10
cipher982/llm-benchmarks #8

Add openvino backend

Greetings, @cipher982! Currently we are working on the Openvino inference framework, and such benchmarks are critical to understand gaps and differences between our framework and Transformers/ TGI …

daniil-lyakhov updated 2 months ago
1
awslabs/llm-hosting-container #20

Latency and Throughput Inquiry

Hello- I've been looking into hosting an LLM on AWS Infrastructure. I am mainly looking to host Flan T5 XXL. My question is below Inquiry: what is the recommended container for hosting Flan T5 X…

ctandrewtran updated 7 months ago
4
opea-project/GenAIExamples #375

UI Enhancement needed

I brought up ChatQnA UI with all the containers. ### Issue 1. Huggingface download update Huggingface TGI container was downloading model, it took so much time around ~12min for Intel/Neural cha…

dhandhalyabhavik updated 1 day ago
4
opendatahub-io/caikit-tgis-serving #74

Add instructions about setting TGI(S) parameters

Users may need to set particular TGI(S) parameter when using Caikit+TGIS runtime on KServe. An example is the model timeout parameter which can be necessary to be tweaked based on the model size. …

bdattoma updated 8 months ago
2
huggingface/text-generation-inference #2027

Fp8 support KV-Cache

### Feature request With more FP8 supported instances becoming available on all major platforms it would be nice if TGI can take advantages of this and start adding FP8 specific features, e.g. `FP8 E…

philschmid updated 3 days ago
1

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for tgi

1000+ results
for tgi