inference-api Search Results

1000+ results
for inference-api

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/text-generation-inference #2222

OpenAI supports `top_p = 0.0` and `top_p = 1.0` but TGI fail…

### System Info * docker image: ghcr.io/huggingface/text-generation-inference:2.0.2 * docker image: ghcr.io/huggingface/text-generation-inference:2.1.1 ### Information - [X] Docker - [ ] The CLI…

michael-newsrx updated 3 days ago
4
FunAudioLLM/CosyVoice #154

webui 克隆音色报错

![image](https://github.com/user-attachments/assets/a5c487e9-97a5-4367-8980-32e2ce129a38) windows平台在docker中运行webui.py ``` Traceback (most recent call last): File "/usr/local/lib/python3.8/di…

Pandas886 updated 2 days ago
1
elastic/elasticsearch #108260

[ML] Inference api cohere rerank unused fields

### Description The cohere rerank implementation allows configuring fields that probably don't apply. The implementation leverages the common settings here: https://github.com/elastic/elasticsearch/b…

jonathan-buttner updated 1 month ago
2
aws-samples/host-yolov8-on-sagemaker-endpoint #21

Your invocation timed out while waiting for a response from …

![image](https://github.com/user-attachments/assets/9686e584-0af5-447a-88d1-b27bff5262e8) --------------------------------------------------------------------------- ModelError …

arjunanand13 updated 22 hours ago
2
Tlntin/Qwen-TensorRT-LLM #106

API for multi-GPU inference

when i use api method to inference ,I find error when i use multi-GPU, I also find that api.py is import run_old.py, it seems that it not can be use multi-GPU?

UIHCRITT updated 3 months ago
4
oneapi-src/oneDNN #1788

GEMM API for efficient LLM inference with W8A16

I want to perform inference on quantized LLAMA (W8A16) on ARM-v9 (with SVE) using oneDNN. The LLAMA weights are per-group quantized. Based on my understanding, I need to prepack the weights to redu…

oleotiger updated 3 days ago
3
langflow-ai/langflow #2065

Got error "Error Building Component

Got error "Error Building Component Error building vertex Hugging Face API: Failed to resolve model_id:Could not find model id for inference server: https://api-inference.huggingface.co/models/mi…

prateekvyas1996 updated 2 days ago
3
NormXU/nougat-latex-ocr #2

HuggingFace Inference API cutting responses short

There seems to be an issue running the model on huggingface (https://huggingface.co/Norm/nougat-latex-base), as responses seem to be cut short. Take for example this image: ![Screenshot 2023-11-29 …

lucasvanmol updated 1 month ago
13
RVC-Boss/GPT-SoVITS #1240

It seems that the move_to_gpu & move_to_cpu is not working a…

It seems that the move_to_gpu & move_to_cpu is not working as expected in the branch fast_inference. https://github.com/RVC-Boss/GPT-SoVITS/blob/fast_inference_/api_v3.py#L327-L343 It will alway…

SetoKaiba updated 1 week ago
6
RVC-Project/Retrieval-based-Voice-Conversion #6

Api not releasing memory after inference

Hi there, I believe I almost have this all figured out and it's working great. One issue I'm having is that after infering one time using the API, memory usage stays very high (12.7gb out of 16), e…

briangrider updated 1 month ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for inference-api

1000+ results
for inference-api