vandal-vpr / vg-transformers

Official Repository of "Learning Sequential Descriptors for Sequence-based Visual Place Recognition "
MIT License
39 stars 5 forks source link

Query regarding models #6

Closed divyagupta25 closed 1 year ago

divyagupta25 commented 1 year ago

Hi, thanks for sharing your work! I have a few questions about the models:

  1. In late fusion and intermediate fusion, the ViT and CCT backbones that you used in your experiments are trained on which dataset?
  2. Will you share the pre-trained model for SeqVLAD with CCT384 backbone (trained on MSLS)?
ga1i13o commented 1 year ago

Hello, thank you for your interest.

  1. For all ViT and CCT experiments, as you can see in our code, we start from the pre-trained on ImageNet. Then, in the main Table 1 of our paper we train them on all the datasets: results in the column 'Melbourne' are trained on melbourne and tested on MSLS test; the second column is trained on all MSLS and test on MSLS test, last column is train and test on Robotcar with splits as described in the paper.
  2. Yes, thank you for reminding me. We are currently exploring hosting options, so for now I added in the README a link to the model that you requested, and hopefully more will come soon
divyagupta25 commented 1 year ago

Thanks a lot for resolving my queries!