-
- https://arxiv.org/abs/2103.14030
- 2021
本稿では、コンピュータビジョンの汎用バックボーンとして機能する、Swin Transformerと呼ばれる新しいVision Transformerを紹介します。
言語から視覚へのTransformerの適応における課題は、視覚的なエンティティのスケールに大きな変化があることや、テキストの単語と比較して画…
e4exp updated
3 years ago
-
https://openreview.net/pdf?id=3F6I-0-57SC
-
Hi @AFeng-x, thanks for sharing the great SMT work!
I'd like to bring up another highly related hierarchical vision transformer that also deals with the scale problems: [MaxViT: Multi-Axis Vision T…
-
参考来源:
```
https://blog.csdn.net/qq_37541097/article/details/121119988
```
论文名称:
[Swin Transformer: Hierarchical Vision Transformer using Shifted Windows](https://arxiv.org/abs/2103.14030)
-
还有个问题,关于PatchEmbed和SwinTransformer_Layer。我没看到代码。
- SwinTransformer_Layer包含了PatchEmbed和SwinTransformerblock吗?
- 很期望提供下这部分代码
-
Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for Text-to-Image Generation
https://arxiv.org/abs/2210.09549
I really wonder if a SwinTransformer based UNet really is that much be…
-
CVPR 2022
#
格式
* **Paper Title**
*Author(s)*
CVPR, 2022. [[Paper]](link) [[Code]](link) [[Website]](link)
需要填充:
1)Paper Title
2) Author(s)
3) 3个“link”
4)两篇文章之间间隔一行
# agent
Meta Ag…
yyf17 updated
2 years ago
-
# Prerequisites
Please answer the following question for yourself before submitting an issue.
- [x] I checked to make sure that this feature has not been requested already.
## 1. The entire U…
-
I have read this part of the official document:
https://maartengr.github.io/BERTopic/getting_started/hierarchicaltopics/hierarchicaltopics.html#visualizations:~:text=Merge%20topics,-%C2%B6
It is rea…
-
GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation
https://arxiv.org/abs/2410.11841