Closed adreena closed 1 year ago
Research: [Alex, Jenish, Christian, Kimia]
try simpler action spaces:
run DQN with discrete action-space [helps debugging the reward/observation adapter]
run continuous baselines (ppo/sac) [ more baselines/ add new baselines]
reward trimming:
observation trimming:
Engineering: [Kimia, Christian, Jenish] urgent:
run DAI/zoo as ego-agent in ultra [debug/record-plots]
later:
Proposal for separate issues:
Research: [Alex, Jenish, Christian, Kimia]
try simpler action spaces:
run DQN with discrete action-space [helps debugging the reward/observation adapter]
run continuous baselines (ppo/sac) [ more baselines/ add new baselines]
reward trimming:
observation trimming:
Engineering: [Kimia, Christian, Jenish] urgent:
run DAI/zoo as ego-agent in ultra [debug/record-plots]
later: