Closed tianyma closed 3 years ago
Yeah, this looks like flattened and unflattened discrete actions are getting mixed up. What policies are you using inside ContextConditionedPolicy
? It probably needs to output a one-hot instead of a discrete action (or we need to put a fix somewhere in the core datatypes).
thank you for your reply, I found I have to unflatten the action then I can get the discrete number.
Hi, I run pearl on my custom environment, but an error occurs, can you help me? I currently use the
master
branch.