constrained-reinforcement-learning Search Results

107 results
for constrained-reinforcement-learning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

deragent/arXivFilter #7

Does not parse arXiv emails with "empty" footer

When pasting the arxiv email, I get 'This is not an arxiv email!' ``` ------------------------------------------------------------------------------ ------------------------------------------------…

louiskirsch updated 1 year ago
7
ray-project/ray #7341

[rllib] Custom model for multi-agent environment: access to …

### What is your question? My goal is to learn a single policy that is deployed to multiple agents (i.e. all agents learn the same policy, but are able to communicate with each other through a shar…

janblumenkamp updated 4 months ago
54
openjournals/joss-reviews #5694

[REVIEW]: PyBADS: Fast and robust black-box optimization in …

**Submitting author:** @GurjeetSinghSangra (Gurjeet Singh) **Repository:** https://github.com/acerbilab/pybads **Branch with paper.md** (empty if default branch): joss-submission **Version:** 1.0.4 **…

editorialbot updated 8 months ago
115
PKU-Alignment/Safe-Policy-Optimization #23

Something about IPO

Hello, thanks for your excellent work. For the IPO algorithm, I wonder that if it is suit for the environment whose action space is discrete? Because of that the interior point algorithm is not suit …

stvsd1314 updated 1 year ago
3
openjournals/joss-reviews #5544

[PRE REVIEW]: PyBADS: Fast and robust black-box optimization…

**Submitting author:** @GurjeetSinghSangra (Gurjeet Singh) **Repository:** https://github.com/acerbilab/pybads **Branch with paper.md** (empty if default branch): joss-submission **Version:** v1.0.0 *…

editorialbot updated 1 year ago
31
second-state/chat-with-chatgpt #179

Alignment research

Can you tell me what exactly is alignment in AI research?

juntao updated 1 year ago
15
Avalon-Benchmark/avalon #9

Can it run multiple agents?

Hi, this is an amazing piece of work, we study reinforcement learning is often highly constrained from the environment, I wonder if it supports multi-agent collaboration? It would be perfect if it cou…

98luobo updated 2 years ago
1
paritytech/polkadot #2888

Collator Protocol Revamp

https://github.com/orgs/paritytech/projects/68 Collators in Polkadot are associated with some particular parachain and are responsible for creating blocks for that parachain as well as Proofs-of-Va…

rphmeier updated 1 year ago
13
tinkoff-ai/CORL #18

adding other published offline method

Hi, Great work. Wondering are you open to add other pushed offline method? Here is the paper: Continuous Doubly Constrained Batch Reinforcement Learning, Neurips 2021 https://proceedings.neurip…

rasoolfa updated 2 years ago
1
mar-wir/StanDDM #2

Issues in the graphical representation of the DDM in the REA…

The README file contains a figure of a hierarchical DDM model with trial-to-trial variability in the drift rate, on top of the the mean drift-rate, threshold and non-decision time parameters. Although…

dkatsimpokis updated 1 year ago
3

上一页 1...4 5 6 7 8 9 10...11 下一页

107 results for constrained-reinforcement-learning

107 results
for constrained-reinforcement-learning