Open varun-intel opened 3 years ago
I'm trying to understand how the metrics in the paper were generated from this code, particularly in the Cooperative Navigation environment. Could you please explain how you aggregate the metrics for each episode? Thanks for your help.
In the paper, the report reward is the reward of the last step in one episode. The occupied landmark is the same case.
I'm trying to understand how the metrics in the paper were generated from this code, particularly in the Cooperative Navigation environment. Could you please explain how you aggregate the metrics for each episode? Thanks for your help.