-
When pasting the arxiv email, I get 'This is not an arxiv email!'
```
------------------------------------------------------------------------------
------------------------------------------------…
-
### What is your question?
My goal is to learn a single policy that is deployed to multiple agents (i.e. all agents learn the same policy, but are able to communicate with each other through a shar…
-
**Submitting author:** @GurjeetSinghSangra (Gurjeet Singh)
**Repository:** https://github.com/acerbilab/pybads
**Branch with paper.md** (empty if default branch): joss-submission
**Version:** 1.0.4
**…
-
Hello, thanks for your excellent work. For the IPO algorithm, I wonder that if it is suit for the environment whose action space is discrete?
Because of that the interior point algorithm is not suit …
-
**Submitting author:** @GurjeetSinghSangra (Gurjeet Singh)
**Repository:** https://github.com/acerbilab/pybads
**Branch with paper.md** (empty if default branch): joss-submission
**Version:** v1.0.0
*…
-
Can you tell me what exactly is alignment in AI research?
-
Hi, this is an amazing piece of work, we study reinforcement learning is often highly constrained from the environment, I wonder if it supports multi-agent collaboration? It would be perfect if it cou…
-
https://github.com/orgs/paritytech/projects/68
Collators in Polkadot are associated with some particular parachain and are responsible for creating blocks for that parachain as well as Proofs-of-Va…
-
Hi,
Great work.
Wondering are you open to add other pushed offline method?
Here is the paper: Continuous Doubly Constrained Batch Reinforcement Learning, Neurips 2021
https://proceedings.neurip…
-
The README file contains a figure of a hierarchical DDM model with trial-to-trial variability in the drift rate, on top of the the mean drift-rate, threshold and non-decision time parameters. Although…