inference-platform Search Results

1000+ results
for inference-platform

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mattyamonaca/PBRemTools #11

Breaks rembg extension upon install, sometimes.

Traceback: File "D:\NasD\stable-diffusion-webui\modules\call_queue.py", line 56, in f res = list(func(*args, **kwargs)) File "D:\NasD\stable-diffusion-webui\modules\call_queue.py", line 3…

B34STW4RS updated 3 weeks ago
4
triton-inference-server/tensorrtllm_backend #475

[Question] Best practises to track inputs and predictions?

Hello, I am seeking advice on the best practices for tracking all inputs and predictions made by a model when using Triton Inference Server. Specifically, I would like to track every interaction th…

FernandoDorado updated 1 month ago
2
ultralytics/ultralytics #14299

YOLO inferences slow in new ultralytics packages

### Search before asking - [X] I have searched the YOLOv8 [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussions) and fou…

Manuel-Weber-ETH updated 1 week ago
4
huggingface/diffusers #8792

Image quality degradation when migrating from automatic1111 …

When migrating from automatic1111 to diffusers, I'm experiencing a significant degradation in image quality despite using the same parameters. The images generated with diffusers are of noticeably low…

Swarzox updated 2 weeks ago
1
triton-inference-server/server #7150

CUDA Graph not work

**Description** CUDA Graph not work in tensorrt backend. The model config as below: ``` platform: "tensorrt_plan" version_policy: { latest: { num_versions: 2}} parameters { key: "execution_mode"…

SunnyGhj updated 2 months ago
4
microsoft/onnxruntime #21272

CUDA_PATH is set but CUDA wasnt able to be loaded

### Describe the issue Im using A1111 and an extension to mask the background. When i try to run the generation to get the mask, i run into some issues. Since there is no install instructions anywher…

TeKett updated 1 week ago
1
AUTOMATIC1111/stable-diffusion-webui #8123

[Bug]: Exception: Error while deserializing header: HeaderTo…

### Is there an existing issue for this? - [X] I have searched the existing issues and checked the recent builds/commits ### What happened? When using controlnet with any models, inference fails wi…

nikkwong updated 1 month ago
14
docker/compose #11842

[BUG] using a local image fails silently for a local target …

### Description Current behavior: Running `docker compose create` and `docker compose --verbose create` silently fail to use a local image, and then loudly fail downstream when it can't pull that …

rsegal updated 1 month ago
2
vllm-project/vllm #2729

Assertion `!(srcMmaLayout && dstMmaLayout) && "Unexpected mm…

When executing script `examples/offline_inference_with_prefix.py`, it will call `context_attention_fwd` from `vllm.model_executor.layers.triton_kernel.prefix_prefill`, which triggered the following er…

gty111 updated 4 days ago
15
microsoft/DeepSpeed #5474

JIT build fails for ROCM 6.0

Am I safe to assume that DeepSpeed does not yet support ROCm 6.0? A whole lot of errors during JIT build of transformer_inference. ``` $ pip show torch Name: torch Version: 2.3.0+rocm6.0 ``` …

Xzensi updated 2 months ago
1

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for inference-platform

1000+ results
for inference-platform