was-attention Search Results

1000+ results
for was-attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/TransformerEngine #1259

[PyTorch] fused attention and cu_seqlens

Hi team, we are currently adapting our training environment to use the fused attention functions. In one of our training setups, we work with batch size one and concaternate multiple documents along …

Marks101 updated 2 weeks ago
2
ValveSoftware/source-sdk-2013 #597

[HL2|HL2MP] An SDK update might be in order

It is very early, I agree, but this is just to bring attention to Valve. With the recent update that was dropped, an [SDK](https://github.com/ValveSoftware/source-sdk-2013) update might be in order fo…

speedvoltage updated 2 days ago
20
NVIDIA/TensorRT-LLM #2429

trt_build for Llama 3.1 70B fp8 fails with CUDA error

### System Info +-----------------------------------------------------------------------------------------+ | NVIDIA-SMI 560.35.03 Driver Version: 560.35.03 CUDA Version: 12.6 | |--------------------…

chrisreese-if updated 6 days ago
1
jax-ml/jax #24539

[Pallas][Flash Attention] Cost analysis in Flash Attention k…

### Description With `0.4.35` release, the flash attention kernel will hit compilation OOM for long sequence length inputs. It failed during compiling the reference attention implementation for cos…

lsy323 updated 2 weeks ago
1
ayman/PrettyJSON #14

Back and tracking...update coming soon.

For some reason, activity on this plugin was muted. No worries, I've fixed it and thank you all for your attention! Will be fixing things as reported....many thanks to all the reporters and contrib…

ayman updated 4 days ago
2
Metaculus/metaculus #1335

Show number of forecasters in the feed

I think the feed should show number of forecasters. This is very useful when interpreting a forecast you see in the feed without having to click into it, and also useful for admins to assess which que…

ryooan updated 2 hours ago
2
TMElyralab/MusePose #81

Stage2 RuntimeError: The size of tensor a (22) must match th…

~/MusePose# accelerate launch train_stage_2.py --config configs/train/stage2.yaml The following values were not passed to `accelerate launch` and had defaults used instead: `--num_processes`…

FangSen9000 updated 1 month ago
1
giantpinkrobots/bootqt #6

Boot drive failsafe

Had an issue with my thumb drive while using bootqt and wasn't paying attention. Bootqt ended up wiping my entire main drive without so much as a prompt warning that it was attempting to clear the act…

JakeBusler updated 2 weeks ago
1
python-hyper/h2 #1285

Potential issue for request smuggling

I would like to report a potential bug I found, which I previously submitted via email on November 1st. Since I haven’t received a response, I wanted to follow up here in case the email was missed or …

tepel-chen updated 1 week ago
1
huggingface/transformers #34573

RuntimeError: linalg.vector_norm: Expected a floating point …

### System Info transformers == 4.45 torch == 2.4.1 + cu118 accelerate == 1.0.1 ### Who can help? _No response_ ### Information - [ ] The official example scripts - [ ] My own modif…

qmin2 updated 1 week ago
3

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for was-attention

1000+ results
for was-attention