pyramid-vision-transformer Search Results

80 results
for pyramid-vision-transformer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Luffy03/Large-Scale-Medical #12

is PTV pyramid vision transformer supported ?

why do you use SwinUNETR as the backbone not pvt ? will pvt be supported ?

argman updated 1 week ago
2
arXivTimes/arXivTimes #2029

Pyramid Vision Transformer: A Versatile Backbone for Dense P…

## 一言でいうと画像分類だけでなく、物体検知やセグメンテーションといったDense PredictionのタスクにTransformerの適用を進めた研究。CNNによるFeature PyramidをTransformerベースで構築しており、Patch表現=>Self-Attention=>全結合を1ステージの処理として重ねる。CNNより高精度を達成 ![image](https…

icoxfog417 updated 3 years ago
1
sunsmarterjie/iTPN #9

Can I use iTPN to study crowd counting?

I find that some Scholars study crowd counting on basis of PVT (Pyramid Vision Transformer). So can I use iTPN to study crowd counting?

whencar updated 1 year ago
1
rishikksh20/LocalViT-pytorch #1

PVT V2+ Locality

Dear @rishikksh20, Thank you for your implementation. I am trying to implement LocalViT in Pyramid Vision Transformer Version 2. Could you give me some hints on how I can achieve this, please? R…

khawar-islam updated 2 years ago
2
xingchenshanyao/NNLearning #35

PVT———Pyramid Vision Transformer2012

参考来源： ``` https://blog.csdn.net/oYeZhou/article/details/114288247 ``` 论文名称： [Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions](https://arxiv.org/pdf/210…

xingchenshanyao updated 1 year ago
7
salesforce/LAVIS #329

Use pretrained Q-Former with multiple image resolutions

In the BLIP-2 paper, it is specified that: "[Q-Former] _extracts a fixed number of output features from the image encoder, independent of input image resolution._". However, when using cross-atten…

david-az updated 1 year ago
1
e4exp/paper_manager_abstract #308

Multi-Scale Vision Longformer: A New Vision Transformer for …

- https://arxiv.org/abs/2103.15358 - 2021 本論文では、2つの技術を用いて、高解像度画像を符号化するためのViTを大幅に強化した、新しいVision Transformer（ViT）アーキテクチャMulti-Scale Vision Longformerを紹介します。 1つ目は、マルチスケールモデル構造で、複数のスケールでの画像符号化を管理可能な計…

e4exp updated 3 years ago
2
nobodyplayer1/VM-UNetV2 #3

Is it reasonable to merge the training set from isic17 and i…

**Excuse me, I just saw the setting code inconfigs/config_setting_v2.py：** elif datasets == 'isic_all': data_path = '/raid/code/mamba_all/VM-UNet/data/zd-medic/isic_all/' **Th…

Frank-Cai0709 updated 4 weeks ago
3
phiphiphi31/SBT #1

Publishing the code

Dear authors, thank you for your inspiring work! The results look pretty promising, and looks like it could even outperform SwinTrack, which is the SOTA on several benchmarks at the moment. I am reall…

zanilzanzan updated 1 year ago
2
phiphiphi31/DualTFR #3

When will the code of DualTFR be released？

Hi, Thank you for the excellent work. Could I ask you when will the code of DualTFR be available? Thank you so much!

ZhanYang-nwpu updated 10 months ago
2

上一页 1...1 2 3 4 5 6 7...8 下一页

80 results for pyramid-vision-transformer

80 results
for pyramid-vision-transformer