attention-architecture Search Results

exitudio/BAMM #3

About Refinement Transformer

Great works! I hope to see all codes of this project soon. I have question about Refinement Transformer. You mentioned that BAMM used RVQ architecture and also used Refinement Transformer for g…

Seoneun updated 6 days ago

rhymes-ai/Allegro #17

Is there a strict requirement for GPUs that support flash_at…

Is there a strict requirement for GPUs that support flash_attention? I tried to test on V100, but this GPU does not support flash_attention, resulting in an error with the Runtime Error: No available …

feng20001022 updated 1 day ago

etalab/transport-site #4223

[WIP à compléter] Points d'attention ou d'amélioration dans …

Suite à une présentation "architecture tech" avec @maxime-siret @AntoineAugusti et @Brewennn ce matin, où on a abordé l'architecture mais aussi les enjeux derrière, je crée ce ticket des possibilités …

thbar updated 4 weeks ago

pytorch/pytorch #137901

Any plan to support flash attention 3 for hopper GPUs?

### 🚀 The feature, motivation and pitch Flash Attention 3 (https://github.com/Dao-AILab/flash-attention) has been in beta for some time. I tested it on H100 GPUs with CUDA 12.3 and also attempted a…

fno2010 updated 3 days ago

huggingface/transformers #34238

GGUF support for BERT architecture

### Feature request I want to add the ability to use GGUF BERT models in transformers. Currently the library does not support this architecture. When I try to load it, I get an error TypeError: Ar…

Dimmension updated 6 days ago

manoa-inspire/MATP #115

Review 7: GraphExamples.jsx and ImageCarousel.jsx

## Overview The focus for this code review will be centered around GraphExamples.jsx and ImageCarousel.jsx. Please pay attention too: * Javascript issues * React components ## Review Br…

blakewatanabe updated 8 hours ago

AkihikoWatanabe/paper_notes #1467

What Matters in Transformers? Not All Attention is Needed, S…

# URL - https://arxiv.org/abs/2406.15786 # Affiliations - Shwai He, N/A - Guoheng Sun, N/A - Zheyu Shen, N/A - Ang Li, N/A # Abstract - While scaling Transformer-based large language models …

AkihikoWatanabe updated 2 days ago

huggingface/transformers #26350

Community contribution: Adding Flash Attention 2 support for…

### Feature request Flash Attention 2 is a library that provides attention operation kernels for faster and more memory efficient inference and training: https://github.com/Dao-AILab/flash-attentio…

younesbelkada updated 1 week ago

108

manoa-inspire/MATP #108

Review 6: auditedBalanceCollection.js and AuditedBalanceSche…

## Overview The focus for this code review will be centered around the auditedBalanceCollection.js and AuditedBalanceSchemaInput.jsx. Please pay attention too: * Javascript issues * React …

blakewatanabe updated 9 hours ago

balazik/ComfyUI-PuLID-Flux #37

bf16 is only supported on A100+ GPUs

# ComfyUI Error Report ## Error Details - **Node Type:** ApplyPulidFlux - **Exception Type:** NotImplementedError - **Exception Message:** No operator found for `memory_efficient_attention_forwa…

Whatup1234ss updated 2 days ago

1000+ results for attention-architecture

1000+ results
for attention-architecture