-
**Submitting author:** @kulbachcedric (Cedric Kulbach)
**Repository:** https://github.com/online-ml/deep-river
**Branch with paper.md** (empty if default branch): paper
**Version:** v0.2.6
**Editor:**…
-
First of all - thank you very much for this repository! You have made diving into Reinforcement Learning easier!
About the issue: I think you should use huber_loss instead of square_difference. Loo…
-
I wonder what can cause such fluctuations of IoU values. The mAP is strangely stable as well.
My training set is very small (~450 for training, ~120 for test).
Config file:
`
batch=64
subdivis…
-
### Description
The text of the caption just before section 6 is misaligned.
### (Optional:) Please add any files, screenshots, or other information here.
_No response_
### (Required) What is this…
-
### ❓ Question
I want to modify the network structure for RecurrentPPO, but when I run the original network, I get the following error
error:
self.features_extractor = features_extractor_class(se…
-
I am trying to train a model using PPO, and the stable-baseline3[extra] library is also installed.
The issue occurs because the StochasticFrameSkip object does not have an action_space attribute, l…
-
Are you thinking of cool ways AI can enhance the authorization game? This is one of our 10 community feature challenges, and we want your input on how we can enhance the Permit CLI in the area of Aut…
-
Really like this project - have you a road map?
-
- [ ] I have marked all applicable categories:
+ [ ] exception-raising bug
+ [ ] RL algorithm bug
+ [ ] documentation request (i.e. "X is missing from the documentation.")
+ [x] ne…
-
What I mean by monitor here is to use gym.wrappers.