multi-head-self-attention Search Results

1000+ results
for multi-head-self-attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

yael-vinker/CLIPasso #21

ClipLoss - RuntimeError: cannot register a hook on a tensor …

Hi, thanks for the nice work and great repo! I changed config to train_with_clip=1 to include ClipLoss. Then, I am getting the following error in the eval step: ![image](https://user-images.githu…

Miriam2040 updated 2 weeks ago
3
Aidenzich/road-to-master #53

Scalable Diffusion Models with Transformers

[Scalable Diffusion Models with Transformers](https://arxiv.org/pdf/2212.09748) Due to the remarkable achievements of Google AlphaFold 3, it also uses DiT, which combines Diffusion and Transformers…

Aidenzich updated 4 months ago
1
kijai/ComfyUI-VEnhancer #2

error report

![image](https://github.com/user-attachments/assets/00501d0e-a886-4a15-b2fd-29b09eee99aa)

T8mars updated 3 weeks ago
6
huggingface/diffusers #9378

[Flux ControlNet] Support Xlabs ControlNet in diffusers

It'd be great to have XLabs ControlNets supported in `diffusers`. We already support their LoRAs. Code: https://github.com/XLabs-AI/x-flux/ Checkpoint: https://huggingface.co/XLabs-AI/flux-contro…

sayakpaul updated 1 week ago
7
huggingface/accelerate #3041

infer_auto_device_map inefficiently allocates GPU memory for…

### System Info ```Shell - `Accelerate` version: 0.33.0 - Platform: Windows-10-10.0.22631-SP0 - `accelerate` bash location: C:\Users\Nech\anaconda3\envs\transformer-multi-device\Scripts\accelera…

Nech-C updated 2 weeks ago
6
snakers4/emoji_sentiment #4

Models for experiments

Models envisioned: - TCN / LSTM / GRU as sequential models; - Decided to abandon HLSTM, indie LSTM, transformer for various reasons; - Attention schemes: - Average pooling - Plain self-attent…

snakers4 updated 5 years ago
1
kolloldas/torchnlp #11

Add & Norm

normalization seems different from the paper #attention is all you need# in paper, normalization layer stays after mha and feed forward layer, in torchnlp, it stays before them x…

xuwenshen updated 5 years ago
1
magic-research/PLLaVA #81

why the output is blank

I have build my own demo file. after uploading one video, it gives blank output. Could anyone help me out? -------------------------------Here's the demo file------------------------- from argpars…

BOYJZ updated 1 week ago
3
Audio-WestlakeU/NBSS #23

Question about mamba edition.

Dear Lee, Awesome job and congratulation! It seems that their is only multi-head self attention edition SpatialNet here. Will you release the online mamba edition in the future? Best!

TungyuYoung updated 6 months ago
3
TencentARC/InstantMesh #131

Loading diffusion model ... Loading pipeline components...: …

**/tmp/tmppngxpwds.obj Traceback (most recent call last):** File "/home/jkx/anaconda3/envs/InstantMesh/lib/python3.10/site-packages/gradio/queueing.py", line 536, in process_events response =…

nagexiaochengzi updated 4 weeks ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for multi-head-self-attention

1000+ results
for multi-head-self-attention