attention Search Results

1000+ results
for attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #10119

[New Model]: dunzhang/stella_en_1.5B_v5

### The model to consider. https://huggingface.co/dunzhang/stella_en_1.5B_v5 last_hidden_state = model(**input_data)[0] in __init__ model: vector_linear = torch.nn.Linear(in_features=model.conf…

cavities updated 1 day ago
20
turboboy215/MET22MID #1

It's not a problem, just a way to reach you

Hi. I am doing a project and I would like to get mid soundtrack files from Pokemon Red. You don't have such a converter yet. I would be very happy if you make one, thanks for your attention

baalp updated 31 minutes ago
1
pytorch-labs/attention-gym #22

[flex_attention] Softcap perf questions

I am using [this pytorch provided script](https://github.com/pytorch/pytorch/blob/main/benchmarks/transformer/score_mod.py) to benchmark flex attention with eager and got the attached results ([defaul…

meshtag updated 2 months ago
6
leejet/stable-diffusion.cpp #22

metal-flash-attention support

Can this project help for you? https://github.com/philipturner/metal-flash-attention So far, metal-flash-attention can indeed provide the fastest generation speed for stable diffusion on MacOS.

czkoko updated 2 months ago
5
zhenhuat/STCFormer #18

About visualizations of attention maps

哈喽作者，感谢您的分享！在浏览您的论文时注意到attention map的可视化，想请教下类似这类注意力图的思路或者参考代码。感谢您的回复！ ![d868f961d8b4fd09e2a70aca4ee9951](https://github.com/user-attachments/assets/a2f578f3-0628-48d9-87ed-6bc6801d694f)

Arginonn updated 3 months ago
2
muzammildafedar/udayah #32

Add Animations to Feature Section for Enhanced Engagement an…

The feature section lacks interactivity and visual appeal, making it less engaging for visitors, which may hinder the website's ability to capture attention. Description of the solution I'd like …

RAj5517 updated 1 month ago
1
facebookresearch/xformers #1110

compile for rocm w/ gfx1032 card

# ❓ Questions and Help Hi All, Debian 13 python3.10.12 venv PyTorch2.4.1_rocm When I try and compile xformers against Pytorch2.4.1_rocm I am ending up with the common "no file found at /th…

brcisna updated 1 month ago
4
Haiyang-W/TokenFormer #1

Maybe we can apply `ring attention` to scale up token former…

Nice paper on making LLM fully attention-based. However, I noticed that the largest model discussed in the paper is a 1.5B model. I wonder if the pattention layer is difficult to tensor parallelize…

reyoung updated 1 week ago
1
huggingface/accelerate #3117

Activation checkpointing with FSDP incorrectly splits the at…

### System Info ```Shell - `Accelerate` version: 0.34.2 - Platform: Linux-5.4.0-45-generic-x86_64-with-glibc2.31 - `accelerate` bash location: /home/gradevski/miniconda3/envs/summary_explainer_p…

gorjanradevski updated 2 weeks ago
4
COOLIRON2311/Mnemonic #1

There are a few things I'd like to know

Hi, I have paid attention to your project and I think it is very good, but I still don't know what the output is. Could you explain it to me roughly? Can it render guassiansplat

crazyzyz updated 1 day ago
1

上一页 1...33 34 35 36 37 38 39...100 下一页

1000+ results for attention

1000+ results
for attention