mlp-mixer Search Results

357 results
for mlp-mixer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

lucidrains/mlp-mixer-pytorch #15

First of all, thank you for your great creation. Is there a …

First of all, thank you for your great creation. Is there a 3D data version of MLP-Mixer

lxy51 updated 1 month ago
13
kozistr/pytorch_optimizer #100

Updated Shampoo uber slow performance

I just swap out Nero optimizer in my Lightning AI loop and gave the new Shampoo a try. There is something going on with it, as this card is typically able to do 2 it per second on almost anything. Old…

redknightlois updated 1 year ago
10
TylerYep/torchinfo #187

estimate model size is different with nvidia-smi usage

**Describe the bug** estimate model size is different with nvidia-smi usage **To Reproduce** 1. used code, and command line 2. The code will run on the cuda:2 device ``` import torch impor…

YHYeooooong updated 2 years ago
6
CongWeilin/GraphMixer #4

possible data leakage

Hi authors, Thanks for your excellent works. However i have met some troubles in reproducing the results reported in paper. I found that there are two points may cause data leakage: ### 1. data le…

towardsagi updated 11 months ago
5
Oneflow-Inc/CoModels #163

模型适配进度（拓展）

总计：8+14+9+26+21+10+3+6+6+10+3+9+11+4+88=228 领域 | 功能 | 基础模型 | 支持方式 | 负责人 | 状态 | 展开数量|Onelab负责人| OneLab公开项目链接 -- | -- | -- | -- | -- | -- | -- | -- | -- cv | classification | EfficientNet_b0| flowvis…

kokuro-asahi updated 11 months ago
89
MusicPlayerDaemon/MPD #219

Weird volume behavior when replay_gain_handler is specified …

## Issues When replay_gain_hander is specified as "mixer", 1. A value from `mpc volume` varies by itself after playing a song 2. A dB value written in replaygain tag is quite different from that ob…

estshorter updated 6 years ago
4
e4exp/paper_manager_abstract #669

Exploring the Limits of Large Scale Pre-training

- https://arxiv.org/abs/2110.02095 - 2021 近年の大規模機械学習の発展は、データ、モデルサイズ、学習時間を適切にスケールアップすることで、事前学習の改善がほとんどの下流のタスクに有利に移行することを観察することができることを示唆している。本研究では、この現象を系統的に研究し、上流の精度を上げると下流のタスクの性能が飽和することを証明しました。 …

e4exp updated 3 years ago
1
frotms/PaddleOCR2Pytorch #19

tools/infer predict_rec.py

字符识别infer时，出现错误，det 和 cls是正常的 RuntimeError: Error(s) in loading state_dict for BaseModel: Missing key(s) in state_dict: "backbone.blocks.0.mid_se.conv1.weight", "backbone.blocks.0.mid_se.c…

birchmi updated 1 year ago
4
dhkim0225/1day_1paper #98

[69] How do vision transformers work? (AlterNet)

왜 ViT 가 잘 working 할까에 대해 연구한 논문. [paper](https://arxiv.org/abs/2202.06709) 일반적으로 생각하는 MSA 가 좋은 이유 ``` MSA 의 어떤 부분이 모델을 위해 좋을까? ==> long range dependency MSA가 conv 처럼 동작할까? ==> MSA 가 general…

dhkim0225 updated 2 years ago
2
YoheiIwasaki/paper-survey #3

Deep Residual Learning for Image Recognition

# 論文情報 - [paper](https://arxiv.org/pdf/1512.03385.pdf) - [github](https://github.com/KaimingHe/deep-residual-networks)

sato163 updated 3 years ago
1

上一页 1...4 5 6 7 8 9 10...36 下一页

357 results for mlp-mixer

357 results
for mlp-mixer