-
### Metadata
Authors: Yaroslav Ganin, Tejas Kulkarni, Igor Babuschkin, S. M. Ali Eslami, Oriol Vinyals
Organization: DeepMind
Release Date: Arxiv 2018
Paper: https://arxiv.org/pdf/1804.01118.pdf
…
-
Hi all,
When I try to run a Mujoco environment with ACKTR algorithm it doesn't work. Here is the full log:
```
Training acktr on mujoco:Hopper-v2 with arguments
{'network': 'mlp'}
Traceback …
R1ckF updated
6 years ago
-
[Dataset](https://schema.org/Dataset) is pretty vague, it can cover anything from .zip files of .wavs of social science interviews, application-specific on-disk file formats, etc etc. In theory we cou…
-
**hi Gareth Jones,i want test the doom example, and i have installed vizdoom and vizdoomgym, i also verfied in anaconda terminal whether the vizdoomgym is ok, i got the below output :**
-----------…
-
### 🚀 Feature
Some reinforcement problems, like [safe reinforcement learning](https://github.com/PKU-Alignment/safety-gymnasium/tree/main), require the environment to return multiple reward-like valu…
-
-
-
Hi all, I was wondering if OmniGibson / Behavior-1K has any manipulation tasks yet, and if there are reinforcement learning or learning from demos baselines?
Moreover, I saw that there are efforts …
-
Require a way to conveniently manually provide initial knowledge for a policy.
For example, say we have a hexagonal grid of which we are tasked to choose a sequence in which it is certainly never cor…
-
## Bug report
- AirSim Version/#commit: 1.3.1
- UE/Unity version: 4.25.4
- OS Version: windows 10
### What's the issue you encountered?
The simulation crashes after any amount of…