tgi Search Results - Githubissues

1000+ results
for tgi

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Maximilian-Winter/llama-cpp-agent #68

Request for image input support

I plan to implement the function calling with vision models such as LLaVA and Nous-Hermes-2-Vision-Alpha based on the image, but it seems that the current implementation in the example folder only sup…

reachsak updated 1 month ago
1
opendatahub-io/caikit-tgis-serving #73

Add documentation about the available metrics

Based on watsonx requirements, we should make available these metrics, at least: - '# of inference requests over defined time period - Avg. response time over defined time period - '# of successf…

bdattoma updated 8 months ago
1
oscinis-com/Awesome-LLM-Productization #1

frameworks for self-hosted serving of llm and vector search

Got some suggestions for frameworks for self-hosted serving of llm and related. # Embeddings from OpenAI clip. Jina https://github.com/jina-ai/clip-as-service (Apache) # Text-embeddings: My o…

michaelfeil updated 2 months ago
1
huggingface/text-generation-inference #2219

Support for no_repeat_ngram_size generation parameter

### Feature request The Transformers library supports the no_repeat_ngram_size parameter for generation. https://huggingface.co/docs/transformers/v4.18.0/en/main_classes/text_generation#transformers.…

njbrake updated 1 week ago
3
huggingface/text-embeddings-inference #195

Serve LARGE embedding models like E5-mistral-7b-instruct

### Feature request Support the recent larger embedding models of 7B or more parameters (20x larger than BERT-large) ### Motivation The embedding models are being much larger than before in the pas…

ai-jz updated 4 months ago
2
opea-project/GenAIExamples #375

UI Enhancement needed

I brought up ChatQnA UI with all the containers. ### Issue 1. Huggingface download update Huggingface TGI container was downloading model, it took so much time around ~12min for Intel/Neural cha…

dhandhalyabhavik updated 3 days ago
4
sobelio/llm-chain #225

add support for Mistral using TGI / vllm / candle

Hi guys love your project I was wondering if you can add support to mistral via: - [TGI](https://github.com/huggingface/text-generation-inference) - [vllm](https://github.com/vllm-project/vllm)…

pabl-o-ce updated 8 months ago
4
opea-project/GenAIExamples #410

OPEA project errors from TCS

Repo used for testing https://github.com/opea-project/GenAIExamples/tree/main/ChatQnA/docker/xeon OPEA Project Errors 2. Embedding Microservice curl : Internal Server Error At line:1 char:1…

preethivenkatesh updated 1 week ago
2
ModelTC/lightllm #277

[Feature]请帮忙提供load_from_weight_dict(weight_dict)接口。

需求背景： TGI适配lightllm，多卡加载模型的时候，用到几张卡就会有几个进程，并且每个进程都会完整的加载整个模型到内存中来。当模型文件太大，比如65B以上的模型，使用8卡加载的话就会需要8*130G的内存，这显然是不合理的，会导致OOM。解决办法：可在lightllm中帮忙提供load_from_weight_dict(weight_dict) 接口。TGI层传入权重词典…

bingo787 updated 6 months ago
10
opendatahub-io/caikit-tgis-serving #226

[RFE] Add openai text generation API compatibility layer in …

Request The ask is to introduce a openai text generation API compatibility layer (chat completion endpoint) to kserve/TGIS. Why Having an openai API compatibility layer will allow more open sourc…

ashishkamra updated 5 months ago
2

上一页 1...10 11 12 13 14 15 16...100 下一页

1000+ results for tgi

1000+ results
for tgi