-
# Actor-Critic Algorithms #
- Author: Vijay R. Konda, John N. Tsitsiklis
- Origin: https://papers.nips.cc/paper/1786-actor-critic-algorithms.pdf
- Related:
- PyTorch4 tutorial of: actor critic…
-
-
## Abstract
- Present training NN to generate sequences using actor-critic method from RL
- Introduce **critic** network that is trained to predict the value of an output token, given the policy of …
-
### What happened + What you expected to happen
# What happened
Using the `Algorithm.add_module` with a `module_state` does not use the module state, but instead loads or builds the module directly…
-
### Please describe the purpose of the feature. Is it related to a problem?
I am inquiring about possibly integrating JAX-based Graph Neural Networks (GNNs) into MAVA for use in MARL. Many MARL algor…
-
-
-
Hi,
Is there any support for the off-policy counterpart of A2C (ACER algorithm) that can be made based on this repo?
This is a very useful repo that we mostly use, and also nice to have its comp…
-
### ❓ Question
I want to modify the network structure for RecurrentPPO, but when I run the original network, I get the following error
error:
self.features_extractor = features_extractor_class(se…
-
Base line run of Rebrac on half cheetah medium v2
https://wandb.ai/jnqian/CORL/runs/a4876f1d-be93-4616-b5d8-2ec84a1a9f5a