Open ipsec opened 2 months ago
Hey, this is on the roadmap however i dont have any immediate plans to implement this. If you'd like to give it a shot, id be more than happy to review it and assist with development. otherwise, it might be a while until this is implemented.
Let me try then. I had a little difficult with the loss function. If you could help me in this part would be great.
@EdanToledo PR #78 created. Like said, I have difficult with the loss function, a good revision is necessary.
Hey, I havent forgotten about this. Sorry its an important PR and will hopefully get to it asap.
Add stochastic muzero implementation - paper and the pseudocode
With this improved version of muzero the stoic could be able to train stochastic environments like the 2048 game and poker (leduc poker)