andytu28 / VQT

BSD 3-Clause "New" or "Revised" License
20 stars 0 forks source link

GPU memory usage #1

Open huofushuo opened 1 year ago

huofushuo commented 1 year ago

Hello, excellent work! I run the VQT, why the GPU memory does not reduce compared to VPT? Thanks and hope for the anwser!

bityangke commented 1 year ago

I am also confused. What are the experiment settings for Figure 3 in this paper?

XXD-N commented 10 months ago

I think it's because although the computation graph is simplified, it introduces more learnable parameters at the end of the task head. And I ran some experiments and found that this method has a slightly worse performance than methods such as adapter, with a performance gap of about 6% from the best performance in my experiments.