Open jerrywn121 opened 1 year ago
aren't patch merging and patch embedding doing the same thing? why do we implement patch merging in another way when we can simply use a kernel of size 2 with stride 2 to produce the output?
aren't patch merging and patch embedding doing the same thing? why do we implement patch merging in another way when we can simply use a kernel of size 2 with stride 2 to produce the output?