-
Thanks a lot for the interesting work!
I am really enjoying reading the paper and the code.
I actually have two minor questions. It will be really appreciated if any hints can be provided:
1. I n…
-
Error reported during testing
Namespace(name='dif-net', epoch=400, dst_list='knee_cbct', split='test', combine='mlp', num_views=3, view_offset=0, out_res=256, eval_npoint=100000, visualize=False)
…
-
Hello,
I've come across an algorithm in the paper that appears to be designed for the M2 layer, with the intention of replacing both the Attention and MLP layers (specifically the nn.Linear part of…
-
### 🚀 The feature
The original paper describes a few more configurations based on swin Transformer.
1. Swin Large: Simply a large model of swin transformer, needs a few config tweaks and we can po…
-
Whenever I set n groups to value higher than 1, the gradients explode after like 70-100M tokens. I have no idea if it's my config causing problems or is it mamba-2 itself (I'm using huggingface implem…
-
https://github.com/arogozhnikov/einops
-
Traceback (most recent call last):
File "/home/boofboy/Desktop/x/main.py", line 124, in
output_ids = model.generate(input_ids=input_ids, max_length=max_length,
File "/home/boofboy/minicond…
-
https://keras.io/examples/vision/convmixer/
-
- https://arxiv.org/abs/2108.01072
- 2021
近年、MLPベースのビジョンバックボーンが登場しています。
誘導バイアスの少ないMLPベースのビジョンアーキテクチャは、CNNやビジョントランスフォーマーと比較して、画像認識において競争力のある性能を達成しています。
その中でも、空間シフトMLP(S2-MLP)は、単純な空間シフト操作を採用しており、M…
e4exp updated
3 years ago
-
小哥,你好:
我尝试git下你的源码,转换PPOCRv3官方提供的模型,成功了!但是使用ppocr训练自己的识别模型却转换不成功,报错信息是,缺少很多:如下:
```
Missing key(s) in state_dict: "backbone.conv1.conv.weight", "backbone.conv1.bn.weight", "backbone.conv1.bn.bias",…