robodhruv / visualnav-transformer

Official code and checkpoint release for mobile robot foundation models: GNM, ViNT, and NoMaD.
http://general-navigation-models.github.io
MIT License
425 stars 56 forks source link

Clarification on Topological Graph based representation of Image map. #5

Closed NigamKatta closed 5 months ago

NigamKatta commented 8 months ago

Hey, I am confused on how the topological graph works. I see it takes in the camera feed and convert it into a series of images naming them sequentially. Isn't needed to be in a tree structure. I remember the paper contains a tree shape. Also, with the current implementation, I believe wont it be difficult to navigate from one room to other room? Also, will it be possible to test it on Gazebo. How accurate would it work with zero shot learning? Do I need to make a long enough training or it ok if I use the current weights?

robodhruv commented 5 months ago

Hi Nigam, Thanks for your interest in our works!

Dhruv