Missing baseline? - Githubissues

michaelsdr / momentumnet

Drop-in replacement for any ResNet with a significantly reduced memory footprint and better representation capabilities

MIT License

207 stars 19 forks source link

Thanks for your interesting work!

The Reformer uses RevNet in a clever way. They double the dimension of x such that for x1,x2=split(x) both x1 and x2 have the same dimension as the original x. This gives their invertible architecture the "same parameters" as the initial architecture. Let's call this ReformerRevNet.

Question 0. In Table 2, RevNet differs to MomentumNet only in the row "same parameters". I don't see why ReformerRevNet and MomNet would be different in Table 2?

Question 1. Is there any reason this ReformerRevNet baseline was not included?

Apologies for any misunderstanding.

michaelsdr / momentumnet

Missing baseline? #28