imankgoyal / NonDeepNetworks

Official Code for "Non-deep Networks"
BSD 3-Clause "New" or "Revised" License
586 stars 42 forks source link

Non-deep Networks
NeurIPS 2022
Ankit Goyal, Alexey Bochkovskiy, Jia Deng, Vladlen Koltun

Overview: Depth is the hallmark of DNNs. But more depth means more sequential computation and higher latency. This begs the question -- is it possible to build high-performing ``non-deep" neural networks? We show that it is. We show, for the first time, that a network with a depth of just 12 can achieve top-1 accuracy over 80% on ImageNet, 96% on CIFAR10, and 81% on CIFAR100. We also show that a network with a low-depth (12) backbone can achieve an AP of 48% on MS-COCO.

If you find our work useful, please consider citing it: ``` @article{goyal2021nondeep, title={Non-deep Networks}, author={Goyal, Ankit and Bochkovskiy, Alexey and Deng, Jia and Koltun, Vladlen}, journal={arXiv:2110.07641}, year={2021} } ```