attention Search Results

1000+ results
for attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #6078

[Feature]: Add support for interchangable radix attention

### 🚀 The feature, motivation and pitch I am working on adjustment of radix attentions now. Thank you for your support for the radix attention. Currently, catching for A that allows for more efficien…

yifan1130 updated 2 weeks ago
1
huggingface/parler-tts #55

Flash Attention Support

Is there any way the Flash Attention 2 support for this model? if there is a way to do it i would love to get involved and help out! I've tried implement by looking at [MusicGen's one ](https://git…

sang-nguyen-ts updated 4 months ago
7
NVIDIA/NeMo #11200

NameError: name' flash_attn_with_kvcache 'is not defined

**Describe the bug** When I use flash attn=2.0.4, running Nemo will result in an error `NameError: name' flash_attn_with_kvcache 'is not defined` After checking the [code,](https://github.com/NVIDIA…

1074224619 updated 10 hours ago
1
CCI-MOC/ops-issues #1426

Install the 2 AMD GPU cards and make available under ESI.

The 2 AMD GPU cards should be at the NERC attention @hakasapl . Please arrange for them to be installed - techsquare? And available under ESI. Price to charge to be addressed in https://github.com…

msdisme updated 14 hours ago
2
unslothai/unsloth #1038

How to use Custom Trainer from hugginface instead of SFTtrai…

I want this trainer class to be implemented with unsloth. How can i do that. ```class CustomTrainier(Trainer): def __init__(self, model, args, train_dataset, eval_dataset, tokenizer, **kwargs)…

ankitprezent updated 1 month ago
1
zulip/zulip #31280

Redesign navbar_alerts banners

We should redesign then navbar_alerts banners (`web/templates/navbar_alerts`). Designs [in Figma]( https://www.figma.com/design/msWyAJ8cnMHgOMPxi7BUvA/Zulip-Web-UI-kit?node-id=563-2713&t=ZDGbub…

alya updated 10 hours ago
2
eeyhsong/NICE-EEG #4

self-attention

When I read the code in your nice_stand.py file, I didn't see you using self-attention or graph attention mechanisms, but you describe this part in your paper ![图片1](https://github.com/eeyhsong/NICE-…

WLC125630WLC updated 5 months ago
4
NVIDIA/TensorRT-LLM #2438

FA V2 Nonusage during Decode/Generation Phase

Hi, Is there a specific reason why FA V2 is being used during prefill phase but not during the Generation phase? Is it due to the fact that Flash attention does not give any significant performance y…

usajid14 updated 15 hours ago
1
myshell-ai/MeloTTS #211

Training vs tensorboard metrics

Will my training yield better results over time? Currently, the training took about 9 hours. I have 1500 wav samples, with a total audio length of approximately 2 hours. ![Screenshot 2024-11-08 at…

smlkdev updated 1 day ago
8
huggingface/transformers #34496

Add support for OLMo November release

### Model description An updated OLMo model will be released in November. The new model has a few small architecture changes compared to the existing model in transformers: - RMSNorm is used inste…

2015aroras updated 1 week ago
2

上一页 1...36 37 38 39 40 41 42...100 下一页

1000+ results for attention

1000+ results
for attention