attention-architecture Search Results

1000+ results
for attention-architecture

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

triton-lang/triton #4308

Flash Attention 3 --> Triton

[Flash attention 3](https://tridao.me/blog/2024/flash3/) makes use of new features of the Hopper architecture. - (async) WGMMA - TMA - overlap softmax Are these all things that can currently (…

jenkspt updated 3 months ago
1
ultralytics/ultralytics #15737

Module implementation

### Search before asking - [X] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussi…

p-dot-max updated 1 month ago
1
huggingface/transformers #34313

speed up whisper compile time

### Feature request after torch compiling the whisper.text_decoder model, the inference time is crazy low !. Thank you for the work ! however the warm up time is very long since it needs to go thr…

jsoto-gladia updated 4 hours ago
5
Anddd7/docs-architecture-diagrams #23

Architecture View: draw the architecture diagram according t…

![image](https://github.com/Anddd7/architecture-diagram/assets/24785373/8f4e54b5-fdbb-4c7f-8177-e5772b950c25)

Anddd7 updated 1 year ago
3
Improbable-AI/curiosity_redteam #7

Which version of trlx and transformers are you using?

No matter whether I load the local model or the gpt2-imdb model from huggingface, the following error is reported: ` ValueError: GPTModelBranch does not support an attention implementation through t…

PamKing7 updated 1 week ago
8
keras-team/keras-cv #606

Include Light Weight Bottle Neck Attention Unet (LWBNA_Unet)…

**Short Description** I would like to add the architecture described in the paper mentioned below. **Papers** A lightweight deep learning model for automatic segmentation and analysis of opht…

fcossio updated 6 months ago
7
ROCm/composable_kernel #1434

WMMA / RDNA3+ kernels for backwards fused attention?

### Problem Description Composable Kernel currently only contains code to support fused attention (FA2) on RDNA3(+) architectures in the forward direction. This greatly increases the VRAM requirement…

Googulator updated 1 week ago
1
bytedance/ABQ-LLM #9

Is there a plan to support model Qwen2?

gloritygithub11 updated 3 weeks ago
1
vllm-project/vllm #7124

[RFC]: Model architecture plugins

### Motivation. As a continuation to #5367 - as this merge request was rejected and I have to maintain my own fork to support this scenario, I suggest we should add support in vLLM for model architec…

NadavShmayo updated 1 month ago
14
ouusan/some-papers #28

Boosting Efficiency

1.Revitalizing optimization for 3d human pose and shape estimation: A sparse constrained formulation(2021) code:No 2.Body meshes as points(2021) regared as a two class classification task(if a grid…

ouusan updated 6 days ago
2

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for attention-architecture

1000+ results
for attention-architecture