Closed BingHan0458 closed 3 years ago
And which models do network_01653000, network_01900000 and network_02360000 correspond to, respectively? How can I watch their structures? Thanks!
Yes, you're correct that is the action for the host_agent. If there are multiple agents in the environment, they would each compute their next action separately (presumably before any of the agents execute the action at that timestep).
To re-create Figs 4 and 6 from the IROS paper, it may be easier to use our gym environment repo. In particular, there's a small test suite script that I think runs those test cases. This might also be a useful repo if you're looking to compare algorithms across random scenarios.
Whichever network file is loaded by default is the GA3C-CADRL-10 that should perform the best. The other ones are either trained for fewer steps or might have had different settings, so it's probably best to ignore the other network files.
hello, author, I try to run 'ga3c_cadrl_demo.py' and it prints 'action: [1.2 0.52359878]'. I understand that it means the next action of the host_agent and is that correct? And how to get the figures (Fig. 4 and Fig. 6) in you paper if I run the code 'network.py'?