avishkarsaha / translating-images-into-maps

Official PyTorch code for 'Translating Images Into Maps' ICRA 2022 (Outstanding Paper Award)
Other
408 stars 49 forks source link

Is this codes really based on what the paper said? #36

Open basbaba opened 1 year ago

basbaba commented 1 year ago

We are studying TIIM and found many problems, the most critical confusions are:

  1. This codes employs model/transformer/Transformer but not the model/transformer/TransformerMonotonic, which should be the main ideal of using MoCha mentioned in the paper.
  2. In TransformerMonotonic, the image features are organized in HxNWxC, that means features are scanned in the order of row by row, not column by column, but the paper puts emphasis on column and explained why vertical features do better to the translating.

Did I misunderstand the whole thing in paper and codes? Please correct me if I'm wrong.

Thanks!