-
hello,
i get this error, when i run code "python main.py --env-name "PongNoFrameskip-v4"
i don't know what happed, my env is:
python3.6.3
Package Version
----------------- -------
a…
-
Hi Lucas,
I've been working on my 3D indoor environment. It's still very basic, but it works, and I just made the repository public: https://github.com/maximecb/gym-miniworld
I've tried to adjus…
-
After value, logit, (hx, cx) = model((Variable(state.unsqueeze(0)),(hx, cx))) in train.py, the program doesn't go on. Do you have any idea?
-
Hello, thank you very much for your wonderful work @YX-S-Z !
I have a question about the COT splitting in RL. In particulaire, in the fragment where you split my the sequence of tokens, shouldn't t…
-
I can't get started train_minteral_shards.py example. Getting this error:
$ python train_mineral_shards.py
Traceback (most recent call last):
File "train_mineral_shards.py", line 14, in
…
-
**Are you requesting a feature or an implementation?**
To handle the partial MDP task, the recurrent policy is currently quite popular. We need to add a lstm layer after the original conv (or mlp) …
-
I have some questions about pysc2 baseline agent of Deepmind.
https://deepmind.com/blog/deepmind-and-blizzard-open-starcraft-ii-ai-research-environment/
I'm trying to implement baseline agent il…
-
Traceback (most recent call last):
File "main.py", line 258, in
main(args)
File "main.py", line 42, in main
train_model()
File "main.py", line 209, in train_model
value_loss, …
-
Since choosing the default acktr algorithm results in “RuntimeError: symeig_cuda: the algorithm failed to converge”, I chose to run the a2c algorithm, but it still stops with an error. The command to …
-
Hello Stefan! We are happy to find your interesting work, and we are conducting further experiments based on your project, I met some problems during the implementation and I carefully read your paper…