text-generation-inference Search Results

1000+ results
for text-generation-inference

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #5234

[Feature]: Add efficient interface for evaluating probabilit…

# Proposed Feature Add an efficient interface for generation probabilities on fixed prompt and completion pairs. For example: ```python # ... load LLM or engine prompt_completion_pairs = [ …

xinyangz updated 2 weeks ago
2
aws/sagemaker-huggingface-inference-toolkit #110

HF_TASK Enviournment Variable error

I have tried to deploy an embedding model in AWS Sagemaker endpoint using the provided guide that uses inference.py to deploy custom code. the endpoint is created and starts but when I query the end-p…

316usman updated 8 months ago
1
huggingface/text-generation-inference #2474

Watermarking cannot be detected

### System Info text-generation-inference version 2.2.0 model "mistralai/Mixtral-8x7B-Instruct-v0.1" ### Information - [X] Docker - [ ] The CLI directly ### Tasks - [X] An officially supported c…

vorwerkc updated 1 month ago
3
tyxsspa/AnyText #89

Text editing issue

Hello , Thank you for sharing your work Actually I have an issue regarding text editing. I want to run only the text editing part on some data that I have and I tried to use the inference code. Th…

ibou810 updated 6 months ago
1
unslothai/unsloth #888

Cache only has 0 layers, attempted to access layer with inde…

When using: **Mistral 7b Text Completion - Raw Text training full example.ipynb** **Last block errors with:** `Exception in thread Thread-17 (generate): Traceback (most recent call last): File…

arturwplantecs updated 3 months ago
2
huggingface/text-generation-inference #2356

TGI does not support DeepSeekCoderV2-gptq

### System Info A100-80GB * 4 ### Information - [X] Docker - [ ] The CLI directly ### Tasks - [X] An officially supported command - [ ] My own modifications ### Reproduction ```shell docker ru…

Grey4sh updated 3 months ago
2
openvinotoolkit/openvino.genai #820

failed to run Llama-2-7b-chat-hf on NPU through Sample/Pytho…

Dears, I failed to run Llama-2-7b-chat-hf on NPU, please give me a hand. 1. I converted the mode by below command, and got two models, a) optimum-cli export openvino --task text-generation -m Meta-…

aoke79 updated 1 week ago
7
GaiZhenbiao/ChuanhuChatGPT #885

[功能请求]: 能否支持text-generation-inference的api

### 相关问题 _No response_ ### 可能的解决办法仓库地址：https://github.com/huggingface/text-generation-inference API接口文档：https://huggingface.github.io/text-generation-inference/ ### 帮助开发 - [ ] 我愿意协助开发！ ### 补充说…

Fuxinlper updated 1 year ago
1
dusty-nv/jetson-containers #357

Error when building text-generation-inference container on J…

Hello, I followed the system setup instructions and tried to build the text-generation-inference container on my Jetson Orin 8GB running JetPack 5.1, but I seem to be running into the following err…

georgejeno8 updated 10 months ago
3
huggingface/text-embeddings-inference #374

Get opentelemetry trace id from request headers instead of c…

### Feature request Currently a new trace is created per each HTTP request. It would be useful if trace is get (if available) from the request using traceparent header as defined in https://opentelem…

ptanov updated 3 weeks ago
2

上一页 1...12 13 14 15 16 17 18...100 下一页

1000+ results for text-generation-inference

1000+ results
for text-generation-inference