inference-platform Search Results

1000+ results
for inference-platform

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/text-embeddings-inference #304

Improve documentation about rerankers: which ones are suppor…

### System Info I am currently mostly working with the ghcr.io/huggingface/text-embeddings-inference:cpu-1.2 Docker image on MacOS. Currently, I am only trying to find out which reranker models with …

AlexanderFillbrunn updated 1 week ago
3
huggingface/diffusers #8792

Image quality degradation when migrating from automatic1111 …

When migrating from automatic1111 to diffusers, I'm experiencing a significant degradation in image quality despite using the same parameters. The images generated with diffusers are of noticeably low…

Swarzox updated 5 days ago
1
mlcommons/ck #1213

running on ARM64?

Hello, I have no idea what I'm doing. I'm trying to run on an ARM64 system, and I get the following errors: I ran the loadgen install and everything worked fine, running the vision benchmarks, fai…

led555 updated 2 months ago
5
microsoft/onnxruntime #19451

[Performance] CPU inference much slower from GPU runtime

### Describe the issue Hey We are planning to add GPU inference (using Mirosoft.ML.OnnxRuntime.Gpu 1.17.0) as an option in our C# software. However, when switching from the CPU ONNX runtime to th…

oliver-bernhardt updated 5 days ago
7
lm-sys/arena-hard-auto #13

[Feature] support arena-hard in opencompass

Hi, Thanks for such a robust work! We have supported ArenaHard dataset in Opencompass now, OpenCompass is an evaluation platform that can partition tasks and support different model inference backend…

bittersweet1999 updated 2 months ago
2
microsoft/onnxruntime #21272

CUDA_PATH is set but CUDA wasnt able to be loaded

### Describe the issue Im using A1111 and an extension to mask the background. When i try to run the generation to get the mask, i run into some issues. Since there is no install instructions anywher…

TeKett updated 2 days ago
1
doantienthongbku/AsConvSR-TorchLighting #1

About model inference acceleration

Hi! Very impressive project! My main goal is to export the model to intermediate format and test accelerability on many platforms. I am trying to accelerate the assembled convolution module for be…

OuterSpaceTraveller updated 4 months ago
1
triton-inference-server/tensorrtllm_backend #475

[Question] Best practises to track inputs and predictions?

Hello, I am seeking advice on the best practices for tracking all inputs and predictions made by a model when using Triton Inference Server. Specifically, I would like to track every interaction th…

FernandoDorado updated 1 month ago
2
unslothai/unsloth #695

Issue with guff Conversion After Finetuning with Unsloth

🦥 Unsloth: Will patch your computer to enable 2x faster free finetuning. ==((====))== Unsloth: Fast Llama patching release 2024.6 \\ /| GPU: NVIDIA A100 80GB PCIe MIG 7g.80gb. Max memory: 7…

mf-skjung updated 1 week ago
2
microsoft/onnxruntime #20219

ONNX Runtime and PyTorch results are different

### Describe the issue I designed and trained a 6D pose estimation algorithm model using pytorch. After that I use torch.onnx.export to convert the pth format parameter file into an onnx inference fi…

W-QY updated 2 months ago
3

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for inference-platform

1000+ results
for inference-platform