wangf3014 / Adventurer

10 stars 0 forks source link

Causal Image Modeling for Efficient Visual Understanding

Official PyTorch implementation of Adventurer, linear-time image models with causal modeling paradigm.

Arxiv: https://arxiv.org/pdf/2410.07599

Release

Models

Model Input Size IN-1k Top-1 Acc. Checkpoint
Adventurer-Tiny 224 78.2 Adventurer_tiny_patch16_224
Adventurer-Small 224 81.8 Adventurer_small_patch16_224
Adventurer-Base 224 82.6 Adventurer_base_patch16_224
Adventurer-Large 224 83.4 Adventurer_large_patch16_224
Adventurer-Base/P8 224 83.9 Adventurer_base_patch8_224
Adventurer-Base 384 84.2 Adventurer_base_patch16_384
Adventurer-Base 448 84.3 Adventurer_base_patch16_448
Adventurer-Base/P8 448 84.8 Adventurer_base_patch8_448

Install

Evaluation

Training

Citation

@article{wang2024causal,
  title={Causal Image Modeling for Efficient Visual Understanding},
  author={Wang, Feng and Yang, Timing and Yu, Yaodong and Ren, Sucheng and Wei, Guoyizhe and Wang, Angtian and Shao, Wei and Zhou, Yuyin and Yuille, Alan and Xie, Cihang},
  journal={arXiv preprint arXiv:2410.07599},
  year={2024}
}