-
I was looking for the implementations of your baselines since the paper says that `lack of baselines make algorithmic comparisons difficult` but could not find anything neither in this repo nor in the…
-
-
The TF version in my Linux env is 2.2, i installed the dependencies based on your requirement.txt, when i run your atari demo, it note me that 'no moudle named tf.contrib', so i down the TF version t…
-
Hi, I am interested in testing my algorithm on `dm_control` dataset as benchmark and I am trapped in data loading part.
Since `dm_control` is a set of sequential control tasks and I have seen in …
-
Hi,
Thanks for your nice work. I want to train an RNN agent with sequential data and I tested it with dm_control_suite. The paper mentioned that: "For sequence data, we also provide future states,…
-
Hey,
I'm wondering if there is any intention to expand the code basis for MuZero unplugged to make it work in an offline RL setting?
-
- [x] I have marked all applicable categories:
+ [ ] exception-raising bug
+ [ ] RL algorithm bug
+ [ ] documentation request (i.e. "X is missing from the documentation.")
+ [x] ne…
-
- [x] I have marked all applicable categories:
+ [x] exception-raising bug
+ [x] RL algorithm bug
+ [x] documentation request (i.e. "X is missing from the documentation.")
+ [x] ne…
-
Hi @takuseno thanks for this amazing library!!
I want to train Mujoco-Gym Continuous Control Tasks with Image Observations. I have few questions regarding this -
1. It seems like the dataset in D4R…
-
```
A fatal error has been detected by the Java Runtime Environment:
#
# EXCEPTION_ACCESS_VIOLATION (0xc0000005) at pc=0x03d9c70d, pid=15208, tid=9740
#
# JRE version: OpenJDK Runtime Environmen…