agent的选择 - Githubissues

mengmeng6660 commented 4 months ago

对于这种零样本问题 Zero-Shot Object Goal Visual Navigation 使用navigation_agent 而您这使用semantic是否合理

rginjapan commented 4 months ago

@wanhaozhangxiaorui 我也注意到了这个问题，想问下一下这两种agent模式有什么区别？？

mengmeng6660 commented 4 months ago

@rginjapan 我看Zero-Shot Object Goal Visual Navigation和TDANet: Target-Directed Attention Network For Object-Goal Visual Navigation With Zero-Shot Ability这两篇论文都提到了零样本导航是训练时只用到可见类在获取当前图像的物体检测框时会将不可见类排除而这篇论文在训练时也会检测到不可见类只不过不会当做目标对象罢了这种改变应该不算零样本导航了吧毕竟训练时也见过了只不过没当做目标罢了我是这么理解的有可能理解错误还望指正

rginjapan commented 4 months ago

@wanhaozhangxiaorui 嗯嗯。跟我想的是一样的。所以我之前一直不理解他为什么能把结果从20多提升到60多，毕竟他的State matrix也没什么大的不同，GCN也是。还有TDANet的提升也很大，不过没有代码可以测。。。我现在在train this work on NavigationAgent, 看看效果和ZSON有多大提升。。。

rginjapan commented 4 months ago

@wanhaozhangxiaorui 你可以试试跑之江的ICRA 2024 ：https://github.com/nuoxu/AKGVP 里面也有zero-shot的evaluation，我也有很多困惑。。。

mengmeng6660 commented 4 months ago

我看过那篇论文和这种基于相似度的有点区别等我跑跑代码有收获我就跟你交流

rginjapan commented 4 months ago

@wanhaozhangxiaorui I evaluated the self-trained model under NavigationAgent class, the SR on unseen is only 0.09, with 0.606 on seen classes, which are inferior than ZSON.

mengmeng6660 commented 4 months ago

I don't know the details of your model, but from your performance metrics, it seems that navigational performance is strongly correlated with target features.

rginjapan commented 4 months ago

It is the same model in this repo, I just used the NavigationAgent class instead of the SemanticAgent for training and evaluation.

mengmeng6660 commented 4 months ago

@rginjapan I understand your point, but there seems to be a misunderstanding. For zero-shot tasks, the zero-shot objects should be excluded during training, not included. Therefore, during training, we use "navigation," and during testing, we switch to using "semantic

rginjapan commented 4 months ago

我用semanticagent跑了一遍（训练时navigationagent），unseen: 0.616, seen, 0.579. 那么Zero-Shot Object Goal Visual Navigation这个好像训练和测试都是用的navigationagent啊？

chuangwaipiaoluohuaxiang commented 3 months ago

我用semanticagent跑了一遍（训练时navigationagent），unseen: 0.616, seen, 0.579. 那么Zero-Shot Object Goal Visual Navigation这个好像训练和测试都是用的navigationagent啊？

你好，请问你可以跑通CIRN的代码吗，我遇到了一些问题。

SmartAndCleverRobot / ICRA-CIRN

agent的选择 #5