DPT: State of the art Semantic-segmentation and Monocular depth estimation network (link to another project)

pjreddie / darknet

Convolutional Neural Networks

Other

25.87k stars 21.33k forks source link

Link to another project: DPT (Dense Prediction Transformers) - State of the art Semantic-segmentation and Monocular depth estimation network

Top-1 accuracy on Pascal-Context Semantic segmentations dataset, and NYU Depth v2 mono-depth dataset, by using visual transformers.
Top-2 on ADE20K Semantic segmentations dataset. The UperNet (Swin-T/S/B/L) network is more accuate on ADE20K but is not real-time, while DPT is faster and real-time.
Paper: https://arxiv.org/abs/2103.13413
GitHub (Pytorch): https://github.com/intel-isl/DPT
Paperswithcode: https://paperswithcode.com/paper/vision-transformers-for-dense-prediction

pjreddie / darknet