issues
search
long8v
/
PTIR
Paper Today I Read
19
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
[73] Simple Open-Vocabulary Object Detection with Vision Transformers
#81
long8v
opened
1 year ago
0
[72] Sparse DETR: Efficient End-to-End Object Detection with Learnable Sparsity
#80
long8v
opened
1 year ago
0
[71] Large Models are Parsimonious Learners: Activation Sparsity in Trained Transformers
#79
long8v
opened
1 year ago
0
feat: review relationformer
#78
long8v
closed
1 year ago
0
[70] SSD: Single Shot MultiBox Detector
#77
long8v
opened
1 year ago
0
[69] End-to-End Object Detection with Transformers
#76
long8v
opened
1 year ago
0
[68] Iterative Scene Graph Generation
#75
long8v
opened
1 year ago
0
huggingface DeformableDetr code reading
#74
long8v
opened
1 year ago
0
[67] Deformable DETR: Deformable Transformers for End-to-End Object Detection
#73
long8v
opened
1 year ago
0
[66] Pointly-Supervised Instance Segmentation
#72
long8v
opened
1 year ago
2
[65] Margin Calibration for Long-Tailed Visual Recognition
#71
long8v
opened
1 year ago
0
[64] Open-Vocabulary DETR with Conditional Matching
#70
long8v
opened
1 year ago
0
[63] Masked Autoencoders Are Scalable Vision Learners
#69
long8v
opened
1 year ago
0
[62] What to Hide from Your Students: Attention-Guided Masked Image Modeling
#68
long8v
opened
1 year ago
0
[61] Generative Modeling by Estimating Gradients of the Data Distribution
#67
long8v
opened
1 year ago
0
[60] Efficient Sparsely Activated Transformers
#66
long8v
opened
1 year ago
0
[59] MLP-Mixer: An all-MLP Architecture for Vision
#65
long8v
opened
1 year ago
0
[58] MetaFormer Is Actually What You Need for Vision
#64
long8v
opened
1 year ago
0
[57] Learning Transferable Architectures for Scalable Image Recognition
#63
long8v
opened
1 year ago
0
[56] NICE: Non-linear Independent Components Estimation
#62
long8v
opened
1 year ago
0
[55] Position Prediction as an Effective Pretraining Strategy
#61
long8v
opened
1 year ago
0
[54] Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models
#60
long8v
opened
1 year ago
0
[53] InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets
#59
long8v
opened
1 year ago
0
[52] Sparse R-CNN: End-to-End Object Detection with Learnable Proposals
#58
long8v
opened
1 year ago
0
[51] Structured Sparse R-CNN for Direct Scene Graph Generation
#57
long8v
opened
1 year ago
0
[50] Generative Adversarial Networks
#56
long8v
opened
1 year ago
0
[49] Sparse Graph Attention Networks
#55
long8v
opened
1 year ago
0
[48] SAC: Accelerating and Structuring Self-Attention via Sparse Adaptive Connection
#54
long8v
opened
1 year ago
0
[47] Recovering the Unbiased Scene Graphs from the Biased Ones
#53
long8v
opened
1 year ago
0
[46] ReFormer: The Relational Transformer for Image Captioning
#52
long8v
opened
1 year ago
0
[45] BGT-Net: Bidirectional GRU Transformer Network for Scene Graph Generation
#51
long8v
opened
1 year ago
0
[44] Context-Aware Scene Graph Generation With Seq2Seq Transformers
#50
long8v
opened
1 year ago
0
[43] Relation Transformer Network
#49
long8v
opened
1 year ago
0
[42] DETRs with Hybrid Matching
#48
long8v
opened
1 year ago
0
[41] Panoptic Scene Graph Generation
#47
long8v
opened
1 year ago
0
[40] Neural Discrete Representation Learning
#46
long8v
opened
1 year ago
0
[39] Auto-Encoding Variational Bayes
#45
long8v
opened
1 year ago
0
[38] Visual Relationship Detection Using Part-and-Sum Transformers with Composite Queries
#44
long8v
opened
1 year ago
0
RelTR code reading
#43
long8v
opened
1 year ago
0
[37] Relationformer: A Unified Framework for Image-to-Graph Generation
#42
long8v
opened
1 year ago
1
[36] SGTR: End-to-end Scene Graph Generation with Transformer
#41
long8v
opened
1 year ago
0
[35] RelTR: Relation Transformer for Scene Graph Generation
#40
long8v
opened
1 year ago
0
[34] What Regularized Auto-Encoders Learn from the Data Generating Distribution
#39
long8v
opened
1 year ago
0
[33] Learning to Prompt for Continual Learning
#38
long8v
opened
1 year ago
0
[32] ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision
#37
long8v
opened
2 years ago
0
[31] GIT: A Generative Image-to-text Transformer for Vision and Language
#36
long8v
opened
2 years ago
0
[30] CoCa: Contrastive Captioners are Image-Text Foundation Models
#35
long8v
opened
2 years ago
2
[29] Grounded Language-Image Pre-training
#34
long8v
opened
2 years ago
0
[28] Learning to Compare: Relation Network for Few-Shot Learning
#33
long8v
opened
2 years ago
0
[27] Cross-Domain Few-Shot Classification via Learned Feature-Wise Transformation
#32
long8v
opened
2 years ago
0
Previous
Next