tgi Search Results - Githubissues

1000+ results
for tgi

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

opea-project/GenAIExamples #330

Document / support for using BFLOAT16 with (Xeon) TGI servic…

The model used for ChatQnA supports BFLOAT16, in addition to TGI's default 32-bit float type: https://huggingface.co/Intel/neural-chat-7b-v3-3 TGI memory usage halves from 30GB to 15GB (and also it…

eero-t updated 2 days ago
1
opea-project/GenAIComps #230

Request to upgrade TGI image to 2.0

Please update TGI image to 2.0 from 1.4 in all TGI readme files. I faced issues with Phi-3 model.

dhandhalyabhavik updated 7 hours ago
2
THUDM/CogVLM2 #127

【text-generation-inference】Unsupported model type cogvlm2

### System Info / 系統信息 A100 ghcr.io/huggingface/text-generation-inference:2.1.0 ### Who can help? / 谁可以帮助到您？ @1049451037 使用tgi部署的遇到报错 ValueError: Unsupported model type cogvlm2 使用的镜像为：gh…

ericzhou571 updated 1 day ago
1
opea-project/GenAIComps #264

Langserve license issue

Langserve is used in: ``` $ git grep -i langserve comps/llms/summarization/tgi/llm.py:from langserve.serialization import WellKnownLCSerializer comps/llms/summarization/tgi/requirements.txt:langse…

eero-t updated 14 hours ago
1
huggingface/text-generation-inference #2121

TGI keeps crashing with 'device-side assert triggered'

### System Info Text-generation-inference: v2.1.0+ Driver Version: 535.161.08 CUDA Version: 12.2 3 GPU: DGX with 8xH100 80GB ### Information - [x] Docker - [ ] The CLI directly ### Tasks - [x…

stefanobranco updated 1 week ago
1
TrelisResearch/one-click-llms #6

IDEFICS 2 8B TGI

When manually launching my fine-tune of idefics2, Huggingface TGI says `Unsupported model type idefics2`. How did you get the Idefics2 TGI to run on runpod?

matbee-eth updated 2 months ago
1
huggingface/text-generation-inference #1873

[Question] Onnx support in TGI

### Feature request Apologies if this should be elsewhere, but I'm curious if you plan on adding support for onnx models like https://huggingface.co/microsoft/Phi-3-mini-128k-instruct-onnx ### M…

Ben-Epstein updated 2 weeks ago
2
stanfordnlp/pyreft #63

[P1] TGI and vLLM support

1. Are there plans for inference support. This is needed if it's to be used by devs in production. 2. Is fine tuning much faster than LoRA? - Optimization and backward pass are MUCH faster, but sure…

RonanKMcGovern updated 2 months ago
7
opea-project/GenAIExamples #329

Suspicous hostIPC usage

Related to #258, why services are using `hostIPC` option [1]: ``` $ git grep hostIPC ChatQnA/kubernetes/manifests/chaqna-xeon-backend-server.yaml: hostIPC: true ChatQnA/kubernetes/manifests/e…

eero-t updated 2 days ago
2
theopenconversationkit/tock #1634

[GenAiOrchestrator] Add Text Generation Inference (huggingfa…

Add integration for [TGI](https://github.com/huggingface/text-generation-inference) LLM provider.

Benvii updated 2 weeks ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for tgi

1000+ results
for tgi