tgi Search Results - Githubissues

1000+ results
for tgi

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

cc65/cc65 #2306

TGI Vector Font Issues - missing documentation, missing defa…

Someone approached me on IRC and asked about how to use text output in TGI. Should be easy enough - i thought. Until i tried to add some simple text output to the existing TGI sample. Apparently not o…

mrdudz updated 8 months ago
2
NVIDIA/TensorRT-LLM #241

Add Transformers logits manipulators

Hi - really interesting work. We're currently using HF TGI in production and exploring using this instead, are there plans to add things like typical_p that transformers supports? Would greatly ease t…

0xymoro updated 10 months ago
2
langchain-ai/langchain #23808

Could not authenticate with huggingface_hub.

### Checked other resources - [X] I added a very descriptive title to this issue. - [X] I searched the LangChain documentation with the integrated search. - [X] I used the GitHub search to find a sim…

amoosebitmymom updated 1 month ago
2
huggingface/text-embeddings-inference #195

Serve LARGE embedding models like E5-mistral-7b-instruct

### Feature request Support the recent larger embedding models of 7B or more parameters (20x larger than BERT-large) ### Motivation The embedding models are being much larger than before in the pas…

ai-jz updated 5 months ago
2
cipher982/llm-benchmarks #8

Add openvino backend

Greetings, @cipher982! Currently we are working on the Openvino inference framework, and such benchmarks are critical to understand gaps and differences between our framework and Transformers/ TGI …

daniil-lyakhov updated 4 months ago
1
opendatahub-io/caikit-tgis-serving #226

[RFE] Add openai text generation API compatibility layer in …

Request The ask is to introduce a openai text generation API compatibility layer (chat completion endpoint) to kserve/TGIS. Why Having an openai API compatibility layer will allow more open sourc…

ashishkamra updated 6 months ago
2
lllyasviel/Omost #100

Using vLLM to deploy LLM as an API to accelerate inference

Based on practical tests, deploying omost-llama-3-8b on an A100 using torch==2.3.0+cu118, vllm==0.5.0.post1+cu118, and xformers==0.0.26.post1+cu118 works well. if want to speed up the process, can ref…

fx-hit updated 4 days ago
3
OSGeo/grass #629

[Bug] Temporal Framework Python interface does not take into…

**Describe the bug** When changing the mapset inside a Python script, the temporal framework does not take this change into account and fail to connect to the temporal database, even thought GRASS re…

lrntct updated 4 years ago
7
huggingface/optimum-neuron #664

Add support for Llama3.1

### Feature request Llama 3.1 is out and should be compatible with Neuron, however, it requires `transformers==4.43.1`, and `optimum-neuron` has pinned `transformers` to `4.41.1`. Notes that sin…

dacorvo updated 6 days ago
4
wellcometrust/sage #10

Streaming

Do you have streaming functionality for auto-regressive LLMs? Something similar to Huggingface TGI for example.

ogencoglu updated 1 year ago
1

上一页 1...13 14 15 16 17 18 19...100 下一页

1000+ results for tgi

1000+ results
for tgi