mlp-mixer Search Results

357 results
for mlp-mixer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

HazyResearch/fly #1

Minor questions about the paper and code

Thanks a lot for the interesting work! I am really enjoying reading the paper and the code. I actually have two minor questions. It will be really appreciated if any hints can be provided: 1. I n…

os-hxfan updated 9 months ago
1
xmed-lab/DIF-Net #3

Please tell me. I see that your data preprocessing is proces…

Error reported during testing Namespace(name='dif-net', epoch=400, dst_list='knee_cbct', split='test', combine='mlp', num_views=3, view_offset=0, out_res=256, eval_npoint=100000, visualize=False) …

yuanpengpeng updated 9 months ago
1
HazyResearch/m2 #8

MonarchMixerLayer

Hello, I've come across an algorithm in the paper that appears to be designed for the M2 layer, with the intention of replacing both the Attention and MLP layers (specifically the nn.Linear part of…

jeohalves updated 1 year ago
1
pytorch/vision #6602

Add other swin architectures.

### 🚀 The feature The original paper describes a few more configurations based on swin Transformer. 1. Swin Large: Simply a large model of swin transformer, needs a few config tweaks and we can po…

oke-aditya updated 4 months ago
6
state-spaces/mamba #522

Exploding gradients if ngroups is higher than 1.

Whenever I set n groups to value higher than 1, the gradients explode after like 70-100M tokens. I have no idea if it's my config causing problems or is it mamba-2 itself (I'm using huggingface implem…

DanFosing updated 2 months ago
3
mlverse/torch #672

einops

https://github.com/arogozhnikov/einops

kevinykuo updated 3 years ago
2
HazyResearch/H3 #25

TypeError: forward() got an unexpected keyword argument 'las…

Traceback (most recent call last): File "/home/boofboy/Desktop/x/main.py", line 124, in output_ids = model.generate(input_ids=input_ids, max_length=max_length, File "/home/boofboy/minicond…

NewDaddy updated 1 year ago
2
JNaranjo-Alcazar/acoustic_scene_dcase2022 #42

Conv-mixer

https://keras.io/examples/vision/convmixer/

JNaranjo-Alcazar updated 2 years ago
6
e4exp/paper_manager_abstract #607

S2-MLPv2: Improved Spatial-Shift MLP Architecture for Vision

- https://arxiv.org/abs/2108.01072 - 2021 近年、MLPベースのビジョンバックボーンが登場しています。誘導バイアスの少ないMLPベースのビジョンアーキテクチャは、CNNやビジョントランスフォーマーと比較して、画像認識において競争力のある性能を達成しています。その中でも、空間シフトMLP(S2-MLP)は、単純な空間シフト操作を採用しており、M…

e4exp updated 3 years ago
1
1079863482/paddle2torch_PPOCRv3 #2

模型转换失败

小哥，你好：我尝试git下你的源码，转换PPOCRv3官方提供的模型，成功了！但是使用ppocr训练自己的识别模型却转换不成功，报错信息是，缺少很多:如下： ``` Missing key(s) in state_dict: "backbone.conv1.conv.weight", "backbone.conv1.bn.weight", "backbone.conv1.bn.bias",…

ChaoII updated 2 years ago
4

上一页 1...1 2 3 4 5 6 7...36 下一页

357 results for mlp-mixer

357 results
for mlp-mixer