It works well on mujoco environments, but not on antmze environment .It did not work even if I changed the parameters according to the paper(expectile=0.9, temperature=10). Can you help me please?
I'm not sure what the problem is but I will have a closer look once I have time.
Could be that its caused due to the cosine schedule for the actor learning rate which I didn't add yet.
It works well on mujoco environments, but not on antmze environment .It did not work even if I changed the parameters according to the paper(expectile=0.9, temperature=10). Can you help me please?