Closed numb3r3 closed 1 year ago
We found in the paper that ToMe works better for larger images than smaller ones (lower accuracy drop, more speed-up). Of course, since there are more tokens you have to increase the number of tokens reduced per layer.
Just curious, whether the patch merging approach is sensitive to the resolution of the input image.