Clarification on Topological Graph based representation of Image map.

robodhruv / visualnav-transformer

Official code and checkpoint release for mobile robot foundation models: GNM, ViNT, and NoMaD.

MIT License

425 stars 56 forks source link

Hi Nigam, Thanks for your interest in our works!

The public code does not actually support the full topological graph and is implemented to get a much simpler version running where there is a single "path" in your graph that you want to follow. This is primarily to keep the release code clean and have a simple starting point for others to use.
Yes, it should be quite easy to get single-trajectory navigation out of the box with the repo (to go from one room to another).
Whether or not the model works well on Gazebo heavily depends on the environment you test it in. Unfortunately, the models were not trained on any sim data, so unless the sim is photorealistic, it may struggle. I would recommend trying out zero-shot with the current weights first and if it doesn't work, then I am happy to share pointers to do a small amount of fine-tuning on sim data to improve performance.

Dhruv

robodhruv / visualnav-transformer