Open search4mahesh opened 6 years ago
Hi,
Thanks for this code repo. I have one question , which environment you are using for RL ?
Thanks Mahesh
Sorry, I don't get the point about the meaning of 'environment'. If your intent is to ask for the RL algorithm, I used policy gradient in this repo.
Hi,
Thanks for this code repo. I have one question , which environment you are using for RL ?
Thanks Mahesh