期待下一篇mamba back

yuweihao / MambaOut

MambaOut: Do We Really Need Mamba for Vision?

Apache License 2.0

1.96k stars 33 forks source link

Open ZihaoZheng98 opened 4 months ago

ImcwjHere commented 3 months ago

清华已经发了

yuweihao commented 3 months ago

清华已经发了

Actually, MLLA uses self-distillation to train the model, making the results unfair to compare with other models.