-
HI, I am trying to train this model but I have some issues when the model run Vicuna model:
RuntimeError: Internal: src/sentencepiece_processor.cc(1101) [model_proto->ParseFromArray(serialized.data()…
-
Thank you for developing trt-llm. It's helping me a lot
I'm trying to use medusa with trt-llm, referencing [this page](https://github.com/NVIDIA/TensorRT-LLM/tree/main/examples/medusa)
It's working f…
-
Thanks for the great work. I was wondering, you used MoE for Mixtral Model. have you actually used it for your model or it was implemented for testing? I see in the scripts you have scripts for llama …
-
### What's the name of your attack?
DSN
### What's the title of the paper where you present your attack?
Don’t Say No: Jailbreaking LLM by Suppressing Refusal
### What's the URL of the paper?
htt…
-
File "/home/lm/OpenFedLLM-main/main_dpo.py", line 109, in
results = trainer.train()
File "/home/lm/yes/envs/opfl/lib/python3.10/site-packages/transformers/trainer.py", line 1539, in train
…
-
I'm trying to use medusa with trt-llm, referencing [this page](https://github.com/NVIDIA/TensorRT-LLM/tree/main/examples/medusa)
It's working fine with vicuna 7B and its medusa heads, as reference in…
-
1. The dowload_xxx.sh files seem not-working(using ubuntu 23.x or 24.x)
2. In vicuna RTL sim, if using built-in downloaded toolchain(version 22.x),
when the below command is issued(as prescribed …
-
### Already reported ? *
- [X] I have searched the existing open and closed issues.
### Regression?
No
### System Info and Version
System/Version info
```sh
Hyprland 0.45.0 built from branch…
-
### Issue Description
Getting the following error:
503 Server Error: Service Temporarily Unavailable for url: https://public-storage.nexa4ai.com/llava-v1.6-vicuna-7b/model-q4_0.gguf
I'm on the …
-