Question about Resnet-18 to encode vision info in Alfred benchmark

askforalfred / alfred

ALFRED - A Benchmark for Interpreting Grounded Instructions for Everyday Tasks

MIT License

352 stars 77 forks source link

Closed xyzhu123 closed 1 year ago

xyzhu123 commented 1 year ago

Hello, may I ask what is the training set used for the resnet-18 used to encode visual scenes in the Alfred dataset?

MohitShridhar commented 1 year ago

@WhyTheMoon, it's just a standard ResNet-18 from PyTorch pre-trained on ImageNet. See this script for details.