attention Search Results

1000+ results
for attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/accelerate #3117

Activation checkpointing with FSDP incorrectly splits the at…

### System Info ```Shell - `Accelerate` version: 0.34.2 - Platform: Linux-5.4.0-45-generic-x86_64-with-glibc2.31 - `accelerate` bash location: /home/gradevski/miniconda3/envs/summary_explainer_p…

gorjanradevski updated 1 week ago
4
zhenhuat/STCFormer #18

About visualizations of attention maps

哈喽作者，感谢您的分享！在浏览您的论文时注意到attention map的可视化，想请教下类似这类注意力图的思路或者参考代码。感谢您的回复！ ![d868f961d8b4fd09e2a70aca4ee9951](https://github.com/user-attachments/assets/a2f578f3-0628-48d9-87ed-6bc6801d694f)

Arginonn updated 2 months ago
2
facebookresearch/xformers #1110

compile for rocm w/ gfx1032 card

# ❓ Questions and Help Hi All, Debian 13 python3.10.12 venv PyTorch2.4.1_rocm When I try and compile xformers against Pytorch2.4.1_rocm I am ending up with the common "no file found at /th…

brcisna updated 1 month ago
4
leejet/stable-diffusion.cpp #22

metal-flash-attention support

Can this project help for you? https://github.com/philipturner/metal-flash-attention So far, metal-flash-attention can indeed provide the fastest generation speed for stable diffusion on MacOS.

czkoko updated 1 month ago
5
jax-ml/jax #23989

Mosaic failed to compile TPU kernel

### Description I am trying to fine-tune Gemma 2 on TPU and got the following error: ``` Traceback (most recent call last): File "/usr/local/lib/python3.10/dist-packages/jax/_src/compiler.py", l…

jhkchan updated 1 month ago
3
Human-Augment-Analytics/CichlidBowerTracking #47

cichlid_bower_tracking/data_distillation/models/transformer/…

charlesrclark1243 updated 3 months ago
1
huggingface/optimum-quanto #347

Weights Still in FP32 after Quantization

Dear quanto folks, I implemented quantization as suggested in your coding example [quantize_sst2_model.py](https://github.com/huggingface/optimum-quanto/blob/main/examples/nlp/text-classification/s…

ClaraLovesFunk updated 1 day ago
1
NVIDIA/TensorRT-LLM #2387

How to use Medusa to support encoder decoder model?

TRT-LLM version: v0.11.0 I'm deploying a bart model with medusa heads, and i notice this issue https://github.com/NVIDIA/TensorRT-LLM/issues/1946, then i adapted my model with follow steps: ``` 1…

TianzhongSong updated 1 week ago
1
DiemondPlayer/Unidye #5

[Compat Request]: Compatibility with Arts and Crafts mod.

I'm aware that you plan to add compatibility slowly, but I just wanted to bring [Arts and Crafts](https://modrinth.com/mod/artsandcrafts) to your attention for eventual compatibility.

Tysos24 updated 5 days ago
1
pytorch/pytorch #137901

Any plan to support flash attention 3 for hopper GPUs?

### 🚀 The feature, motivation and pitch Flash Attention 3 (https://github.com/Dao-AILab/flash-attention) has been in beta for some time. I tested it on H100 GPUs with CUDA 12.3 and also attempted a…

fno2010 updated 2 weeks ago
5

上一页 1...30 31 32 33 34 35 36...100 下一页

1000+ results for attention

1000+ results
for attention