Closed thuanaislab closed 3 months ago
Hi, thank you for the great work. Can you answer the question above? Thank you very much.
Hi, DINOv2 is used during inference. DINOv2 coarse features are used to guide/prune the attention map constructed between the two images' features.
Hi, thank you for the great work. Can you answer the question above? Thank you very much.