-
-
### 📚 Documentation
I am trying to understand how the RecurrentActorCriticPolicy works. Coming from an NLP background I am used to have tensors of the shape (batch_size, seq_len, feature_dim) as inpu…
-
While running a TRPO train, after some time (random - anywhere from 15sec to 1min) it kicks with the following:
`Traceback (most recent call last):
File "callback.py", line 196, in
model.lea…
-
[Self Imitation Learning](https://arxiv.org/abs/1806.05635)
@emrul has implemented SAIL, see https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/pull/139#issuecomment-1445114579
@em…
-
Currently I have a huge dilemma:
- backport all my code to TF 1, in order to use Stable Baselines and my code in one project
- or use something less mature than Stable Baselines (eg TF Agents) only …
-
Dear altruists, I am new at **stable baselines and RL**. I am trying to retrain my previously trained PPO1 model as like it will start learning from where it was left in the previous training. What I …
-
Dear Author,
I hope this message finds you well. First, I would like to thank you for sharing your project on GitHub. Your work is incredibly valuable, and I appreciate the effort you have put in…
-
When I try to run the code below I get this error at the pretrain function:
**Error**
```
File "C:\Users\fabio\Desktop\wetransfer-08d028\Rope_ex_v1.5\RL_Training\behaviour_cloning.py", line 40, i…
-
**Describe the bug**
pwnagotchi work fine in auto mode. when switch from auto to AI mode, it got error in logs and not discover AP anymore.
logs:
[2020-09-28 14:31:02,190] [ERROR] [ai] error whi…
-
It seems to me that when HER samples an achieved goal from the replay buffer it never samples the very last state of the episode. Is this intended?
As a consequence, the sampling strategy "final" …