self-attention Search Results

1000+ results
for self-attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/NeMo #11431

Question about attn_context_size in fastconformer_hybrid_tra…

I'm trying to fit the Hybrid Fastconformer for streaming, and found a strange thing, in the config that you specify with huggingface, it is indicated that self_attention_model: rel_pos. However, in th…

jootanehorror updated 3 days ago
1
vllm-project/vllm #416

[Feature Request] Support input embedding in `LLM.generate()…

Hi, I am using llm as part of a multimodal model, so the model needs to pass `input embedding tensor` directly to generate, and also need to access the language model's `embed_tokens` member to fist c…

KimmiShi updated 1 week ago
15
modelscope/ms-swift #2514

Using multimodal datasets to train ovis1_6-gemma2-9b, an err…

Loading checkpoint shards: 0%| | 0/5 [00:00

c-x-l-w updated 4 days ago
1
facebookresearch/xformers #1138

vllm 0.6.3 createLLM error TypeError: autotune() got an unex…

# 🐛 Bug from vllm import LLM, SamplingParams llm = LLM(model=model_dir,enforce_eager=True) then ``` File d:\my\env\python3.10.10\lib\site-packages\xformers\ops\fmha\_triton\splitk_kernels.…

xiezhipeng-git updated 1 month ago
4
dreamgaussian/dreamgaussian #165

The problem for Image+Text-to-3D (ImageDream) and Text-to-3D…

**When I run these two, I get this error: RuntimeError: The shape of the 2D attn_mask is torch.Size([77, 77]), but should be (1, 1). Specific errors are as follows:** F:\Miniconda\envs\dream\lib\site…

qixuanwang-233 updated 2 weeks ago
3
Yuanshi9815/OminiControl #30

error when run subject.ipynb

RuntimeError Traceback (most recent call last) Cell In[5], line 10 5 prompt = "On Christmas evening, on a crowded sidewalk, this item sits on the road, covered in …

Shuqiao-Zou updated 7 hours ago
1
NVlabs/BundleSDF #173

How to run BundleSDF on high resolution video?

I want to run BundleSDF on custom data of high resolution (e.g. 720x1080). If I follow [this instruction](https://github.com/NVlabs/BundleSDF?tab=readme-ov-file#run-on-your-custom-data), I face the fo…

kgg012392 updated 2 weeks ago
1
CUNY-CL/yoyodyne #139

Add self attention encoder

With the decoupling of encoders and decoders, we have added a `Linear` encoder, which seems to just embed the inputs and pass them along. We should also add a `SelfAttention` encoder, which encodes th…

Adamits updated 1 year ago
1
BAAI-DCAI/Bunny #138

Question about deepspeed checkpoint loading

I tried to load Lora training adapters from Deepspeed checkpoint: dir: ``` ls Bunny/checkpoints-llama3-8b/bunny-lora-llama3-8b-attempt2/checkpoint-6000 total 696M -rw-r--r-- 1 schwan46494@gmail.c…

Wintoplay updated 2 weeks ago
1
zjunlp/KnowledgeCircuits #9

使用llama-3.1-8b运行knowledge_eap.ipynb时，报错梯度尺寸不匹配

感谢作者在可解释性方面做出的优秀工作。我目前在用llama-3.1-8b做一些研究，在给transformer_lens中添加了`meta-llama/Llama-3.1-8B-Instruct`的支持代码后，运行`knowledge_eap.ipynb`，发现在第6个单元格计算`attribute(model, g, data, partial(logit_diff, loss=True, me…

cnlnpjhsy updated 3 days ago
1

上一页 1...11 12 13 14 15 16 17...100 下一页

1000+ results for self-attention

1000+ results
for self-attention