Closed a1wj1 closed 1 year ago
I haven't understood the code for generating the action yet. Could you please explain it? Thank you
Hi, the code just calculates the mode of the Beta distribution specified by the alpha and beta values. And it then converts the value between [0,1] to [-1,1].
I haven't understood the code for generating the action yet. Could you please explain it? Thank you