kyegomez / ViTAR

Implementation of ViTaR: ViTAR: Vision Transformer with Any Resolution in PyTorch
MIT License
22 stars 1 forks source link

Where is the Fuzzy Positional Encoding section ? #5

Open FurkanKt opened 2 weeks ago

FurkanKt commented 2 weeks ago

Hello,

First of all, thank you for the code you developed. I see that some sections in the article are missing. For example, I could not find the Fuzzy Positional Encoding section in the code.

Are you considering adding a FPE block?

Upvote & Fund

Fund with Polar

github-actions[bot] commented 2 weeks ago

Hello there, thank you for opening an Issue ! 🙏🏻 The team was notified and they will get back to you asap.

kyegomez commented 2 weeks ago

@FurkanKt ah yes, I might have missed it. Can you submit a pr for it?