-
Hi, this question is about the linear projections l_Q, l_K, l_V of the attention module in the paper Prompt-to-Prompt. The paper illustrated that the linear projections are learnable. However, in the …
-
### What happened?
```
You are a helpful assistant
> what is 2+2+2+2
44444444444444444444444444444444444444444444444444444444444444444444444444444444444444444
>
```
When I run llama-cli with…
-
Hi I'm trying to understand the run the training code, but I keep running into the issue on line 998 in `seq2seq.py`. As far as I can tell, it's because the encoder_inputs_tensor shape is (?, ?, 512) …
-
Could you kindly provide the code for training models, please.
-
my computh_dir.sh is
```
## set MODEL_PATH, num_samples, has_subfolder, images_dir, recons_dir, dire_dir
export CUDA_VISIBLE_DEVICES=0
export NCCL_P2P_DISABLE=1
MODEL_PATH="../models/256x256_diff…
-
Using released llmcompressor 0.1.0 on python 3.11 on ubuntu 20.04
Phi3Small Instruct does not have the default weights in the mapping (q_proj, k_proj, v_proj), so I supplied my own and it failed wi…
-
### What happened?
If you pass `tfs_z` param to the server, it crashes sometimes.
Starting the server:
```
~/test/llama.cpp/llama-server -m /opt/models/text/gemma-2-27b-it-Q8_0.gguf --verbose
`…
-
### Please check that this issue hasn't been reported before.
- [X] I searched previous [Bug Reports](https://github.com/OpenAccess-AI-Collective/axolotl/labels/bug) didn't find any similar reports…
-
### System Info
Traceback (most recent call last):
File "/home/powerop/.conda/envs/bamboo…
-
### What is the issue?
It's again the https://github.com/ollama/ollama/issues/6011 issue.
**The issue is with embedding call with the model converted using convert_hf_to_gguf.py.**
litellm.ll…