inference-server Search Results

1000+ results
for inference-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

roboflow/inference #583

Too low (frame) FPS compared to documentation

### Search before asking - [X] I have searched the Inference [issues](https://github.com/roboflow/inference/issues) and found no similar bug report. ### Bug ## Set Up I use a Basler Camera acA1…

clausMeko updated 1 month ago
19
intel-analytics/ipex-llm #11914

ipex Llama.cpp server fails with Phi3 models

Hi, I've trying to serve different Phi3 models using the Llama.cpp server that is created by the init-llama-cpp ipex. When I server with this version I have two problems: 1) The server doesn…

hvico updated 1 month ago
2
ultralytics/hub #855

Shared Inference API Example Request Python & cURL code not …

### Search before asking - [X] I have searched the HUB [issues](https://github.com/ultralytics/hub/issues) and found no similar bug report. ### HUB Component Inference ### Bug 3 issues listed be…

curtinmjc updated 3 hours ago
2
open-ce/open-ce #1181

[FEEDSTOCK REQUEST] vllm

**Describe the package you'd like added** `vllm` has become a popular inference server for LLMs: https://github.com/vllm-project/vllm **Describe how this package fits in with the project** GenAI/…

lehrig updated 3 days ago
2
open-mmlab/mmdeploy #859

<Interval inference> slower than <continuous inference> when…

When I use faster-rcnn TRT model inference server, there is no error reported, it works well. But I found a strange phenomenon that when I try to send a series of pictures to model at the same time, i…

Joyphy updated 1 year ago
12
sherlock-audit/2024-06-allora-judging #12

volodya - forecast-implied inferences can be set to any valu…

volodya High # forecast-implied inferences can be set to any value due to ForecastElements is not filtered by duplicate. ## Summary forecast-implied inferences can be set to any value due to Foreca…

sherlock-admin4 updated 1 month ago
2
alibaba/rtp-llm #105

speculate sampling用medusa加载medusa官方模型报错

- 环境 - docker: registry.cn-hangzhou.aliyuncs.com/havenask/rtp_llm:0.1.13_cuda12 - cuda: 12.1 - driver: 515.105.01 - 模型： - llama: https://huggingface.co/lmsys/vicuna-33b-v1.3 …

wcsjtu updated 1 month ago
6
triton-inference-server/tensorrtllm_backend #458

two seemingly identical functions in the same file

there are two `gen_random_start_ids` in tools/utils/utils.py https://github.com/triton-inference-server/tensorrtllm_backend/blob/ae52bce3ed8ecea468a16483e0dacd3d156ae4fe/tools/utils/utils.py#L238-L…

dongluw updated 3 months ago
1
unslothai/unsloth #793

Error when deploying on HF inference endpoints

Hi there, First thank you for unsloth, it's great! I've finetuned a llama-3-8b-Instruct-bnb-4bit and pushed it to hf hub. When I try to deploy it using [hf Inference Endpoints](https://huggingfa…

adamrobertolo78 updated 2 months ago
1
fastmachinelearning/SonicCMS #17

Open issues for triton-inference-server (round 2)

Tracking the second round of issues submitted to [triton-inference-server](https://github.com/triton-inference-server/server): - [ ] https://github.com/triton-inference-server/server/issues/2018: Con…

kpedro88 updated 7 months ago
2

上一页 1...11 12 13 14 15 16 17...100 下一页

1000+ results for inference-server

1000+ results
for inference-server