Ram81 / goat-bench

60 stars 7 forks source link

Add croco visual encoder and ImageNav scripts #1

Closed gchhablani closed 8 months ago

gchhablani commented 1 year ago

This PR adds croco to visual encoders. The original CrocoNet was a combined model containing encoder-decoder and masking logic. Here, it has been separated for the purpose of goal embedding caching and the masking logic has been removed as we will not be pretraining.

Additionally, we need a new binocular encoder for goal+obs embedding, which has been added in this PR.

To-do: