inference-server Search Results

1000+ results
for inference-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

TabbyML/tabby #2657

Answer Engine Quality - Ideas

***Under Construction*** The Answer Engine, released in version 0.13, provides a Q&A interface for Tabby's users to interact with the LLM, optionally within the context of a connected repository. T…

wsxiaoys updated 4 days ago
4
triton-inference-server/server #4725

Fail to fetch PR with `--repo-tag`

**Description** I'm trying to build with my pending core PR with: ```shell ./build.py \ -v \ --no-container-build \ --repo-tag=core:pull/108/head ``` It fails with: ``` Cloning into…

fgervais updated 2 years ago
3
mlflow/mlflow #13724

[BUG] How to collect logs for the results of AML real-time i…

### Issues Policy acknowledgement - [X] I have read and agree to submit bug reports in accordance with the [issues policy](https://www.github.com/mlflow/mlflow/blob/master/ISSUE_POLICY.md) ### W…

C1022G-Thanh-Tu updated 1 week ago
3
elastic/kibana #193633

[ML] Add support for returning `pt_tiny_elser` from the `ml.…

In plumbing through the Security Assistant Knowledge Base API integration tests (https://github.com/elastic/kibana/pull/192665), I ended up having to add support for a `modelId` override to our API's …

spong updated 1 month ago
1
mlcommons/inference #1819

ResNet50 inference command error

I followed the [document](https://docs.mlcommons.org/inference/benchmarks/image_classification/resnet50) to inference ResNet50, using MLCommons-Python -> edge -> Tensorflow -> CUDA -> Native The comm…

xeasonx updated 3 months ago
4
QwenLM/Qwen2-VL #282

Which transfomer version could be used with VLLM 0.6.2?

VLLM 0.6.2 had just released few hours ago, it said no support multi image inference with Qwen2-VL. I've try it, but it require the newest transformer and automatic install it. When I start it u…

bash99 updated 1 month ago
13
alibaba/rtp-llm #91

HELP: No matching distribution found for torch==2.1.0+cu121 …

I'm preforming the instruction in Startup example, and `pip3 install -r ./open_source/deps/requirements_torch_gpu_cuda12.txt` completed successfully, but when I installed maga_transformer-0.2.0+cud…

HuXinjing updated 3 months ago
7
onnx/tutorials #155

Error 500 "no model loaded" with Tutorial OnnxRuntimeServer…

Hi, when running the tutorial `OnnxRuntimeServerSSDModel.ipynb` I have this response from the server ```python response = requests.post(inference_url, headers=request_headers, data=request_messa…

mazzma12 updated 4 years ago
3
triton-inference-server/tensorrtllm_backend #572

Failed to launch triton server, the tensorrt_llm protobuf fi…

### System Info Docker image: nvcr.io/nvidia/tritonserver:24.07-trtllm-python-py3 Device: 8x H100 trt-llm backend: v0.11.0 ### Who can help? @byshiue @schetlur-nv ### Information - [ ] The off…

KuntaiDu updated 1 month ago
2
aws/sagemaker-pytorch-inference-toolkit #165

Zombie process exception

**Describe the bug** Getting zombie process exception as already reported for the [sagemaker-inference-toolkit](https://github.com/aws/sagemaker-inference-toolkit/pull/133) **To reproduce** Using…

5agado updated 5 months ago
4

上一页 1...74 75 76 77 78 79 80...100 下一页

1000+ results for inference-server

1000+ results
for inference-server