-
Hi, I'm new to wandb and is running code in [ppo-implementation-details](https://github.com/vwxyzjn/ppo-implementation-details) following the video tutorial [Part 1 of 3 — Proximal Policy Optimization…
-
The bug will cause gradient exploding when add action mask in the dist_fn.
-
### Description
The quantum open systems master equation has the Schrodinger portion of the equation plus the non-unitary thermal dissipation part. The objective here is to come up with an implemen…
-
Example: https://mila.quebec/en/publications/
It would be nice to reuse the same code as in the Mila website. Not sure if that's 'easily' possible via RTD
-
-
Hi @ChanganVR
When I run:
`python ss_baselines/av_nav/run.py --exp-config ss_baselines/av_nav/config/audionav/replica/train_telephone/audiogoal_depth_ddppo.yaml \
--model-dir data/models/ss2/repli…
-
Hi,
I used your implementation for my research project and would like to credit the work. Is it possible to add a BibTex to the readme.md?
-
-
I recently read [this](https://arxiv.org/pdf/1707.06347.pdf) paper on Proximal Policy Optimization, which is being used by [OpenAI Five](https://openai.com/blog/openai-five/) to beat competitive DotA …
-
## Question
### What's your question?
I am currently using the `moveByAngleRatesZAsync()` api command, and I notice that after each call completes with `.join()` , the drone automatically…