linear-attention-model Search Results

1000+ results
for linear-attention-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

google/prompt-to-prompt #46

learned linear projections l_Q, l_K, l_V

Hi, this question is about the linear projections l_Q, l_K, l_V of the attention module in the paper Prompt-to-Prompt. The paper illustrated that the linear projections are learnable. However, in the …

cuixing61 updated 1 year ago
1
ggerganov/llama.cpp #9838

Bug: Llama.cpp with cuda support outputs garbage response wh…

### What happened? ``` You are a helpful assistant > what is 2+2+2+2 44444444444444444444444444444444444444444444444444444444444444444444444444444444444444444 > ``` When I run llama-cli with…

bmahabirbu updated 4 hours ago
7
da03/Attention-OCR #34

model_with_buckets fails because of wildcard input size

Hi I'm trying to understand the run the training code, but I keep running into the issue on line 998 in `seq2seq.py`. As far as I can tell, it's because the encoder_inputs_tensor shape is (?, ?, 512) …

joytafty updated 7 years ago
1
mattia93/GRNet #2

model code

Could you kindly provide the code for training models, please.

Maverick172 updated 2 months ago
1
ZhendongWang6/DIRE #30

It seams that the DIRE tensor save format: jpg or png, deter…

my computh_dir.sh is ``` ## set MODEL_PATH, num_samples, has_subfolder, images_dir, recons_dir, dire_dir export CUDA_VISIBLE_DEVICES=0 export NCCL_P2P_DISABLE=1 MODEL_PATH="../models/256x256_diff…

JYccode updated 3 weeks ago
5
vllm-project/llm-compressor #105

Yaml parsing fails with a custom mapping provided to SmoothQ…

Using released llmcompressor 0.1.0 on python 3.11 on ubuntu 20.04 Phi3Small Instruct does not have the default weights in the mapping (q_proj, k_proj, v_proj), so I supplied my own and it failed wi…

aatkinson updated 5 days ago
6
ggerganov/llama.cpp #9587

Bug: passing `tfs_z` crashes the server

### What happened? If you pass `tfs_z` param to the server, it crashes sometimes. Starting the server: ``` ~/test/llama.cpp/llama-server -m /opt/models/text/gemma-2-27b-it-Q8_0.gguf --verbose `…

z80maniac updated 2 weeks ago
2
axolotl-ai-cloud/axolotl #1706

Zero loss and nan grad_norm when Flash Attention is enabled

### Please check that this issue hasn't been reported before. - [X] I searched previous [Bug Reports](https://github.com/OpenAccess-AI-Collective/axolotl/labels/bug) didn't find any similar reports…

fgdfgfthgr-fox updated 2 weeks ago
2
NVIDIA/TensorRT-LLM #1093

chatGLM3-6B Build TensorRT engine(s) error

### System Info Traceback (most recent call last): File "/home/powerop/.conda/envs/bamboo…

wohushihaoren updated 4 months ago
6
ollama/ollama #6946

llama runner process has terminated: exit status 0xc0000005

### What is the issue? It's again the https://github.com/ollama/ollama/issues/6011 issue. **The issue is with embedding call with the model converted using convert_hf_to_gguf.py.** litellm.ll…

viosay updated 2 weeks ago
4

上一页 1...19 20 21 22 23 24 25...100 下一页

1000+ results for linear-attention-model

1000+ results
for linear-attention-model