-
Getting a big whopper of an error trying to apply the optimizations shown in the [directions for CogVideoX](https://huggingface.co/THUDM/CogVideoX-2b)
```
-----------------------------------------…
-
Hii there, I cloned this repo to run deepstream_ignition_IP_File_rtsp_yolo.py file , but it turns out that yolo.weights file is missing can you please upload or share the link to it...
-
Based on my understanding, a quantized model (e.g., INT8 version) should run faster than an FP32 model, since the hardware has a specific acceleration unit for INT8 data computation. But the experimen…
-
We have a few questions specific to KFS V2 based GRPC method for inference.
1. Is KFS V2 also meant to support tabular data based payload or only for tensor based ML/DL workload ?
2. As KFS V2 …
-
It would be great to get the instructions to run the 3B model locally on a gaming GPU (e.g. 3090/4090 with 24GB VRAM).
### Confirmed GPUs
From this thread
| GPU Model | VRAM (GB) | Tuned-3b | T…
-
We keep this issue open to collect feature requests from users and hear your voice. Our monthly release plan is also available here.
You can either:
1. Suggest a new feature by leaving a comment.
…
-
I really like the simplicity of TK and think it could be broadly applicable to kernel authoring beyond attention. Has there been any benchmarking done of pure GEMM operations? If so, an example would …
-
This is continuation of #5080, but with more specific goals.
### Goal
To measure luci-interpreter performance and memory consumption for models delivered with tflite-micro: https://github.com/tens…
-
I am following the int8 tutorial at https://github.com/mlfoundations/open_clip?tab=readme-ov-file#int8-support but I cannot make it work with the latest version of open clip.
Installing the require…
-
Platforms: linux
This test was disabled because it is failing in CI. See [recent examples](https://hud.pytorch.org/flakytest?name=test_retrace_export_while_loop_simple_cpu_float32&suite=TestHOPCPU&li…