inference-server Search Results

1000+ results
for inference-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

triton-inference-server/model_navigator #33

TensorRT-LLM Triton Backend Support

When can NAV support creating Triton Repo for this new backend? Is it on your roadmap? https://github.com/triton-inference-server/tensorrtllm_backend

shixianc updated 3 months ago
6
immich-app/immich #12930

Getting error in ASGI failed to allocate memory

### The bug I am just looking at my logs because of an issue I am having with facial recognition, these errors are unrelated as they happened during the night, but I wanted to draw some attention to …

rayzorben updated 1 month ago
5
mlflow/mlflow #13420

[FR] Add option to copy relative directory structure when sp…

### Willingness to contribute Yes. I would be willing to contribute this feature with guidance from the MLflow community. ### Proposal Summary When logging a model using `mlflow.pyfunc.log_model`, …

djalusic updated 3 weeks ago
2
elastic/elasticsearch #116443

[CI] MlWithSecurityIT test {yaml=ml/inference_crud/Test forc…

**Build Scans:** - [elasticsearch-periodic #4769 / openjdk23_checkpart4_java-matrix](https://gradle-enterprise.elastic.co/s/6ttsee3pmzlnc) - [elasticsearch-pull-request #40154 / part-4](https://gradle…

elasticsearchmachine updated 1 week ago
2
SeungjunNah/DeepDeblur-PyTorch #54

Inference error

1. I used this command for inference but encountered issue. Anyone knows how to fix this? - command: `python launch.py --n_GPUs 1 main.py --batch_size 8 --precision single` - error : `[W socke…

davidvct updated 7 months ago
3
roboflow/inference #419

`DocTR` model output missing important information that mode…

### Search before asking - [X] I have searched the Inference [issues](https://github.com/roboflow/inference/issues) and found no similar feature requests. ### Description `DocTR` produces not only…

PawelPeczek-Roboflow updated 3 months ago
1
roboflow/roboflow-python #144

predict() for URLs sets image_dims to "Undefined"

When calling `model.predict('https://example.com/test.jpg)` with a URL the response contains: ``` results.image_dims {'width': 'Undefined', 'height': 'Undefined'} ``` Which is unfortunate sin…

saschwarz updated 1 month ago
2
QwenLM/Qwen2-VL #71

vllm推理结果全是感叹号

安装教程，使用vllm出错，显卡H100 ，昨天晚上拉的最新镜像 1、no module 'Qwen2-7B-Instruct', python -m vllm.entrypoints.openai.api_server --served-model-name Qwen2-VL-7B-Instruct --model model_path chat_response = …

wangyongbing updated 1 month ago
7
bigscience-workshop/petals #587

Petals doesn't deal with server failure properly

Hi there, we'd like to report our findings on testing Petals' availability of fault tolerance. We note that the current implementation of the method _step_ in the class __ServerInferenceSession_ fr…

oldcpple updated 3 months ago
4
huggingface/text-generation-inference #2615

Excessive use of VRAM for Llama 3.1 8B

### System Info - text-generation-inference:2.3.0, deployed on docker - model info: { "model_id": "meta-llama/Llama-3.1-8B-Instruct", "model_sha": "0e9e39f249a16976918f6564b8830bc894c89659…

ukito-pl updated 1 month ago
1

上一页 1...40 41 42 43 44 45 46...100 下一页

1000+ results for inference-server

1000+ results
for inference-server