-
When can NAV support creating Triton Repo for this new backend? Is it on your roadmap?
https://github.com/triton-inference-server/tensorrtllm_backend
-
### The bug
I am just looking at my logs because of an issue I am having with facial recognition, these errors are unrelated as they happened during the night, but I wanted to draw some attention to …
-
### Willingness to contribute
Yes. I would be willing to contribute this feature with guidance from the MLflow community.
### Proposal Summary
When logging a model using `mlflow.pyfunc.log_model`, …
-
**Build Scans:**
- [elasticsearch-periodic #4769 / openjdk23_checkpart4_java-matrix](https://gradle-enterprise.elastic.co/s/6ttsee3pmzlnc)
- [elasticsearch-pull-request #40154 / part-4](https://gradle…
-
1. I used this command for inference but encountered issue. Anyone knows how to fix this?
- command: `python launch.py --n_GPUs 1 main.py --batch_size 8 --precision single`
- error :
`[W socke…
-
### Search before asking
- [X] I have searched the Inference [issues](https://github.com/roboflow/inference/issues) and found no similar feature requests.
### Description
`DocTR` produces not only…
-
When calling `model.predict('https://example.com/test.jpg)` with a URL the response contains:
```
results.image_dims
{'width': 'Undefined', 'height': 'Undefined'}
```
Which is unfortunate sin…
-
安装教程,使用vllm出错,显卡H100 , 昨天晚上拉的最新镜像
1、no module 'Qwen2-7B-Instruct',
python -m vllm.entrypoints.openai.api_server --served-model-name Qwen2-VL-7B-Instruct --model model_path
chat_response = …
-
Hi there, we'd like to report our findings on testing Petals' availability of fault tolerance.
We note that the current implementation of the method _step_ in the class __ServerInferenceSession_ fr…
-
### System Info
- text-generation-inference:2.3.0, deployed on docker
- model info:
{
"model_id": "meta-llama/Llama-3.1-8B-Instruct",
"model_sha": "0e9e39f249a16976918f6564b8830bc894c89659…