was-attention Search Results

1000+ results
for was-attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

SAFE-Stack/SAFE-template #633

Possible improvement to error logging in the console

Recently an issue was raised in the SAFE Dojo repo which was ultimately as the result of an error message being a bit buried. Link: https://github.com/CompositionalIT/SAFE-Dojo/issues/185 The er…

jwthomson updated 3 weeks ago
2
vllm-project/vllm #7366

[RFC]: Encoder/decoder models & feature compatibility

## Motivation # There is significant interest in vLLM supporting encoder/decoder models. Issues #187 and #180 , for example, request encoder/decoder model support. As a result encoder/decoder supp…

afeldman-nm updated 1 week ago
12
keras-team/keras-hub #1613

GemmaBackbone.get_layout_map broken for gemma_2b_en

**Describe the bug** When attempting to shard a `gemma_2b_en` model across two (consumer-grade) GPUs, I get: ``` ValueError: One of device_put args was given the sharding of NamedSharding(mesh=…

josharian updated 2 months ago
5
Standard-Intelligence/hertz-dev #11

How to run Inference?

Config: Windows 10 with RTX4090 All requirements incl. flash-attn build - done! Server: ``` (venv) D:\PythonProjects\hertz-dev>python inference_server.py Using device: cuda Loaded tokeniz…

SuperMaximus1984 updated 1 week ago
8
JWFanggit/LOTVS-CAP #7

How to generate driver attention map in advance?

Hi，@JWFanggit This is a very excellent work, and I am currently using it. I would like to ask how the driver attention map in the dataset was generated in advance? Which open-source model was used?

DaliMIT01 updated 1 month ago
1
exo-explore/exo #459

With exo unable to run llama-3.2-1b

It's trying load and never completed ``` Removing download task for Shard(model_id='llama-3.2-1b', start_layer=0, end_layer=15, n_layers=16): True 0%| …

FFAMax updated 3 days ago
5
pytorch/pytorch #134385

FlopCounterMode doesn't support HOP

### 🐛 Describe the bug I'm trying to add micro-benchmark for flex attention, which is implemented by HOP. I use ```torch.utils.flop_counter.FlopCounterMode```, but it doesn't support capture FLOP f…

yanboliang updated 3 weeks ago
3
comfyanonymous/ComfyUI #3161

Im getting this error clip missing: ['clip_l.logit_scale', '…

i dont know if this affects anything when i generate i get this clip missing: ['clip_l.logit_scale', 'clip_l.transformer.text_projection.weight'] Loading 1 new model C:\Users\heruv\ComfyUI\comfy\ld…

iPhail87 updated 1 month ago
30
ecmwf/anemoi-models #44

Make Flash attention configurable

### Is your feature request related to a problem? Please describe. The current implementation causes issues when loading old model checkpoints during inference as it is not clear whether flash attent…

theissenhelen updated 2 months ago
1
microsoft/winget-pkgs #180229

Using the original publisher's name in the identifier for a …

It's normal for people to make and release custom builds for the projects that didn't provide any pre-built binary or pre-built Windows binary (i.e., only the source code or ELF binaries are provided,…

SpecterShell updated 1 month ago
2

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for was-attention

1000+ results
for was-attention