tgi Search Results - Githubissues

1000+ results
for tgi

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/TensorRT-LLM #241

Add Transformers logits manipulators

Hi - really interesting work. We're currently using HF TGI in production and exploring using this instead, are there plans to add things like typical_p that transformers supports? Would greatly ease t…

0xymoro updated 8 months ago
2
huggingface/text-generation-inference #2316

Request failed during generation: Server error: Batch ID 408…

I have encountered the problem mentioned in the title. Could someone help me understand what is going on and how to resolve it? Any assistance would be greatly appreciated.

leizhao1234 updated 5 hours ago
5
actinia-org/actinia-core #192

Add "tgis" support in actinia for persistent processing and …

At time the directory `tgis` is not yet copied: https://github.com/mundialis/actinia_core/blob/e1510fce08f8d40d5fc048dc7fd81e920ed5ffc3/src/actinia_core/resources/persistent_processing.py#L477 I…

neteler updated 1 year ago
6
logancyang/obsidian-copilot #374

Add support for Hugging Face Inference API

**Is your feature request related to a problem? Please describe.** Currently we are hosting Open Source Models like Mixtral-8x7B with the Hugging Face Inference Endpoint. With the new tgi 1.4 Version…

kteppris updated 4 months ago
2
huggingface/optimum-benchmark #222

Possibility to add multiple users / concurrent user requests…

Hi there :-) Is there a possibility to configure multiple users / concurrent request sessions? I'd like to simulate how the different backends behave if not 1 user, but e.g. 8 users concurrently a…

mgiessing updated 3 weeks ago
1
modal-labs/modal-examples #615

we should pin model revisions on Hugging Face

models can still be yanked, but this should reduce the variability Here's an example, which uses tgi: ```py def download_model(): subprocess.run( [ "text-generation-s…

charlesfrye updated 4 months ago
1
WisdomShell/codeshell-vscode #33

使用TGI加载本地模型时报错

我是用TGI加载本地模型CodeShell-7B-Chat，但是加载过程中报错，我使用的命令如下： ```sh sudo docker run --gpus 'all' --shm-size 1g -p 9090:80 -v /home/CodeShell/WisdomShell:/data --env LOG_LEVEL="info,text_generation_router=debug"…

Virtual1257 updated 4 months ago
1
picocomputer/ehbasic-plus #8

Consider a "minimalist" library.

Building upon the current plotting / textmode enhancements and its isolated yet modular-packaging, consider a "[minimalist](https://github.com/picocomputer/ehbasic-plus/issues/1#issuecomment-189084126…

netzerohero updated 5 months ago
2
NetEase-FuXi/EETQ #10

Quantization takes a very long time

Using TGI or Lorax eetq quantization takes several minutes (Eg 10 minutes for Mixtral) every time the launcher is run . As a reference bitsandbytes nf4 quant takes 1 minute. Is there any way to …

timohear updated 3 months ago
3
NVIDIA/TensorRT-LLM #722

Mixtral generation doesn't stop

Hello, We are using latest main TensorRT LLM and container build with TensorRT-Backend to run Mixtral. Generation doesn't stop and goes until max_tokens is reached. Passing "end_id": 2 doesn't help. …

yessenzhar updated 4 months ago
6

上一页 1...12 13 14 15 16 17 18...100 下一页

1000+ results for tgi

1000+ results
for tgi