Open NrealWJX opened 6 months ago
Thank you for your excellent work!
I was wondering about the performance for patch size p = 1, which was not shown in your paper. Can you please explain why this was not experimented with? Was it due to memory constraints on a single TPU-v3 with a batch size of 1?
Looking forward to your reply! :)
Thank you for your excellent work!
I was wondering about the performance for patch size p = 1, which was not shown in your paper. Can you please explain why this was not experimented with? Was it due to memory constraints on a single TPU-v3 with a batch size of 1?
Looking forward to your reply! :)