Closed baochi0212 closed 2 months ago
This one is a tricky one, it does improve the overall speed because a much bigger BS can be used when this feature is on.
may be we should put an Yes*
next to it and write a note under the table:
It slows things down for the given batch size, but since it frees up a lot of memory, enabling a much larger BS, it actually improves the overall speed.
What do you think?
Yeah, i agree. I thought that it must be "No" for fixed batch size. Ok, I changed the pr.
thinking more about it, I think a No*
is probably more correct as you suggested.
Thank you for this contribution, @baochi0212!
grad checkpoint tiny error about speed