Closed HYeCao closed 3 months ago
Hi here is some of our experiment data on pick-place-wall. In some seeds, the agent may fail. (DrmCAC=ACE and DrmCBAC=ACE+BEE)
Please check your MetaWorld and Gym version, also the reward and state type. We will check the metaworld wrapper again.
Here is the a snapshot of our WandB dashboard for the walker2d -v2 environment, without any smoothing techniques. Some suggestions on HPs that may enhance the performance in walker2d-v2, try "batch_size: 256, updates_per_step: 1, target_update_interval: 1, hidden_size: 256"
Thank you very much for your reply. I will try it again by your HPs. Can you provide a complete settings of HPs in different environments and tasks?
update experiment curves
When reproducing experimental environments such as walker2d, pick place wall, etc., we have observed significant discrepancies between our results and those reported in the papers. What could be causing this issue?