tgi Search Results - Githubissues

1000+ results
for tgi

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

opea-project/GenAIExamples #890

[Bug] chatqna-llm-uservicestatus is CrashLoopBackOff when d…

### Priority P1-Stopper ### OS type Ubuntu ### Hardware type Xeon-GNR ### Installation method - [X] Pull docker images from hub.docker.com - [ ] Build docker images from source …

shaohef updated 4 days ago
2
OSGeo/grass #629

[Bug] Temporal Framework Python interface does not take into…

**Describe the bug** When changing the mapset inside a Python script, the temporal framework does not take this change into account and fail to connect to the temporal database, even thought GRASS re…

lrntct updated 4 years ago
7
NVIDIA/TensorRT-LLM #241

Add Transformers logits manipulators

Hi - really interesting work. We're currently using HF TGI in production and exploring using this instead, are there plans to add things like typical_p that transformers supports? Would greatly ease t…

0xymoro updated 11 months ago
2
huggingface/text-generation-inference #2185

Phi-3 mini 128k produces gibberish if context >4k tokens

### System Info GPU: RTX4090 Run 2.1.0 with docker like: `docker run -it --rm --gpus all --ipc=host -p 8080:80 -v /home/jp/.cache/data:/data ghcr.io/huggingface/text-generation-inference:2.1.0 …

jphme updated 1 month ago
4
wellcometrust/sage #10

Streaming

Do you have streaming functionality for auto-regressive LLMs? Something similar to Huggingface TGI for example.

ogencoglu updated 1 year ago
1
cc65/cc65 #2306

TGI Vector Font Issues - missing documentation, missing defa…

Someone approached me on IRC and asked about how to use text output in TGI. Should be easy enough - i thought. Until i tried to add some simple text output to the existing TGI sample. Apparently not o…

mrdudz updated 10 months ago
2
opea-project/GenAIInfra #414

enable hpa-values.yaml test in helm charts e2e test in CI

daisy-ycguo updated 2 weeks ago
1
cipher982/llm-benchmarks #8

Add openvino backend

Greetings, @cipher982! Currently we are working on the Openvino inference framework, and such benchmarks are critical to understand gaps and differences between our framework and Transformers/ TGI …

daniil-lyakhov updated 6 months ago
1
MadcowD/ell #56

Integrate with various LLM providers

- [ ] OpenAI - [ ] Anthropic - [ ] Groq - [ ] Cohere - [ ] Llama some how (ollama & groq are fine)

MadcowD updated 2 weeks ago
7
huggingface/text-embeddings-inference #195

Serve LARGE embedding models like E5-mistral-7b-instruct

### Feature request Support the recent larger embedding models of 7B or more parameters (20x larger than BERT-large) ### Motivation The embedding models are being much larger than before in the pas…

ai-jz updated 6 months ago
2

上一页 1...14 15 16 17 18 19 20...100 下一页

1000+ results for tgi

1000+ results
for tgi