-
This page is accessible via [roadmap.vllm.ai](https://roadmap.vllm.ai)
### Themes.
As before, we categorized our roadmap into 6 broad themes: broad model support, wide hardware coverage, state of…
-
How can I prune or spars my custom YOLOv5m model ?
What the defference between pruning and sparsity? As I understand pruning just set zeros to some weights, but sparsity destroy some connections b…
-
I am trying to prune with
python main.py \
--model mistralai/Mistral-7B-Instruct-v0.2 \
--prune_method wanda \
--sparsity_ratio 0.5 \
--sparsity_type unstructured \
--save o…
-
Hi, I'm running evaluation.py on MNLI as described in the README, but I'm getting different results compared to what's displayed there. I'm using Google Colab for this, and you can find my notebook he…
-
Is it possible to do semi-structured sparsity for lower inference latency? Thanks!
-
**Describe the bug**
Error running with Mamba: `'HookedMamba' object has no attribute 'W_E'`.
**Code example**
```
cfg = LanguageModelSAERunnerConfig(
model_name="state-spaces/mamba-2.8b"…
-
**Describe the issue**:
When i pruned yolov7 model, i met this error:
“
Traceback (most recent call last):
File "/home/shihuiyu/yolov7-main/prune.py", line 750, in
train(hyp, opt, devi…
-
When fitting a gamlss model with `pvc()`, the `predict` method doesn't work.
Minimal reprex:
```
age
-
If you are submitting a bug report, please fill in the following details and use the tag [bug].
**Describe the bug**
I encountered a RuntimeError during training while using sae_lens. The error ap…
-
**Describe the bug**
This is a minor issue, but I think the quantization configuration in the file `[examples/quantization_24_sparse_w4a16/2:4_w4a16_group-128_recipe.yaml]`(https://github.com/vllm-pr…