tgi Search Results - Githubissues

1000+ results
for tgi

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/optimum-benchmark #222

Possibility to add multiple users / concurrent user requests…

Hi there :-) Is there a possibility to configure multiple users / concurrent request sessions? I'd like to simulate how the different backends behave if not 1 user, but e.g. 8 users concurrently a…

mgiessing updated 1 week ago
1
OSGeo/grass #3749

[Feat] library: Add Standard parser options for date and tim…

**Is your feature request related to a problem? Please describe.** Modules that process spatio-temporal data often use date and time input. It would be useful to have some standard parser options for…

ninsbl updated 1 month ago
1
AutoGPTQ/AutoGPTQ #238

Multiple GPU Support

I have 2 2070 supers and would love to be able to use them in parallel. Would be possible to enable memory pooling. I know it is in theory supported by pytorch. Any chance it can be added here so that…

ammorcos updated 1 month ago
2
opendatahub-io/caikit-tgis-serving #73

Add documentation about the available metrics

Based on watsonx requirements, we should make available these metrics, at least: - '# of inference requests over defined time period - Avg. response time over defined time period - '# of successf…

bdattoma updated 7 months ago
1
huggingface/optimum-tpu #43

Issue getting Llama3 8b running on GKE

I'm trying to deploy Llama3 8b on GKE using optimum but running into some troubles. Following instructions here: https://github.com/huggingface/optimum-tpu/tree/main/text-generation-inference. I bu…

francescov1 updated 1 week ago
22
opendatahub-io/caikit-tgis-serving #226

[RFE] Add openai text generation API compatibility layer in …

Request The ask is to introduce a openai text generation API compatibility layer (chat completion endpoint) to kserve/TGIS. Why Having an openai API compatibility layer will allow more open sourc…

ashishkamra updated 4 months ago
2
sobelio/llm-chain #225

add support for Mistral using TGI / vllm / candle

Hi guys love your project I was wondering if you can add support to mistral via: - [TGI](https://github.com/huggingface/text-generation-inference) - [vllm](https://github.com/vllm-project/vllm)…

pabl-o-ce updated 7 months ago
4
huggingface/text-generation-inference #2206

TGI crashes on Multi GPUs

I am trying to run TGI on Docker using 8 GPUs with 16GB each, using the following command: docker run --gpus all --name tgi --shm-size 1g --cpus="5.0" --rm --runtime=nvidia -e HUGGING_FACE_HUB_TOKEN=…

RohanSohani30 updated 2 days ago
1
ModelTC/lightllm #277

[Feature]请帮忙提供load_from_weight_dict(weight_dict)接口。

需求背景： TGI适配lightllm，多卡加载模型的时候，用到几张卡就会有几个进程，并且每个进程都会完整的加载整个模型到内存中来。当模型文件太大，比如65B以上的模型，使用8卡加载的话就会需要8*130G的内存，这显然是不合理的，会导致OOM。解决办法：可在lightllm中帮忙提供load_from_weight_dict(weight_dict) 接口。TGI层传入权重词典…

bingo787 updated 5 months ago
10
huggingface/chat-ui #556

Error when building chat UI space with default config HF

Hello, I have a space that used to work until yesterday, it is created with the standard chat-ui docker image. Since today when it is built, I have the following error: ``` --> RUN npm run build …

lccnl updated 7 months ago
2

上一页 1...9 10 11 12 13 14 15...100 下一页

1000+ results for tgi

1000+ results
for tgi