-
I've been about this every way I can think of and am unable to find a solution.
I can run the manual and example clients on the carla-server with no problem.
When I try to run 'python run_CIL.py…
-
## Bug description
Hi all, while trying to save locally on my filesystem a trajectories list I discovered that the save method of the serialize module is not working as expected, at list as presented…
-
Hi,
I manually controlled the ant robot from this example (https://github.com/NVIDIA-Omniverse/IsaacGymEnvs/blob/main/isaacgymenvs/tasks/ant.py) and recorded the corresponding joint angle values.
I…
-
how can I use reinforcement learning algorithms to train my model. After that, how can I evaluate my model, should I only focus on the "pour"task?
-
In order to perform imitation learning, we need that our domain experts (Markowitz models) label the data with state-action pairs
-
Hello, after I use waypoint to train in the simulation environment, I find that the success rate of using waypoint is 90%, while the success rate of not using waypoint is 98%, which decreases the succ…
-
| Team Name | Affiliation |
|---|---|
| KeroKero | Unity Technologies |
- Paper: [Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information…
-
| Team Name | Affiliation |
|---|---|
| SSV | McGill University; McGill University; McGill University |
- Paper: [Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Advers…
-
Hi, thanks for this innovative work!
May I ask if you have conducted more ablation experiments on the data (e.g., just train more steps in imitation learning phase)?I think it's necessary to justif…
-
- [ ] [system-2-research/README.md at main · open-thought/system-2-research](https://github.com/open-thought/system-2-research/blob/main/README.md?plain=1)
# OpenThought - System 2 Research Links
He…