代码有问题 - Githubissues

VITA-Group / UVC

[ICLR 2022] "Unified Vision Transformer Compression" by Shixing Yu*, Tianlong Chen*, Jiayi Shen, Huan Yuan, Jianchao Tan, Sen Yang, Ji Liu, Zhangyang Wang

MIT License

44 stars 3 forks source link

代码有问题 #1

Open Daner-Wang opened 2 years ago

Daner-Wang commented 2 years ago

依赖都已经安装，不存在依赖的问题，但是代码运行时还是各种乱七八糟的错，有些错误在编辑器里就能明显看到，希望上传能够正确运行的代码，以及作者所展示的最优结果的log文件。

Daner-Wang commented 2 years ago

We eliminated some errors in the code and tried to reproduce the author's experiment with the parameters given by the author, however, the experimental results were far different from those in the paper. In author‘s paper, the proportion of remained FLOPs in deit-tiny is 49.23%, while in our reproducing the proportion is more than 70%, which is confusing. In addition, we also found that the author's calculation of FLOPS might be wrong in the given code.

billysx commented 2 years ago

Sorry for the late reply. We promise we will finish cleaning the code by the end of the next week and upload the full fixed codes.

billysx commented 2 years ago

Thank you for pointing out the issues in our codebase, our response to your problems are as follows:

We’ve pushed a new version of our cleaned code without running errors after all dependencies are installed. Apologies for the running errors.
Thank you for pointing out that the calculation of FLOPs of the fourth term should be multiplied by r_ratio instead of r_ratio$^2$. Actually, the result of DeiT-Small and DeiT-Base are all using r_ratio instead of r_ratio$^2$ for calculation. We only preserve the experiment results of DeiT-Small, provided with a full-scheduled training log updated under the file "log/", please check. As seen in the screen shot, the best accuracy is 78.82% with remained FLOPs percentage 50.41% as the finetuned model.
As for the calculation of FLOPs, due to multi-version of organizing the code, the numbers for DeiT-Tiny are under investigation. We will provide an update as soon as possible.

billysx commented 1 year ago

Update Jul. 7th: new logs of DeiT-Tiny and DeiT-Base For DeiT-Tiny, it’s 49.23% in the paper, the reproduce result is 48.45% at epoch 9 in the screenshot. Which is not exactly the same but are very close.

For DeiT-Base, it’s 45.5% in the paper, the reproduce result has a 42% model at epoch 17, and a 50% model at epoch 19. Although these two results are not exactly the same numbers as it is in the paper(randomness), these two proves that the problem of cannot compressing to 50% FLOPs is not there. For DeiT-Base, using a different z-learning-rate schedule "5,10,15,20,25" can get the result.

The logs that are mentioned are under file log/.