-
# Actor-Critic Algorithms #
- Author: Vijay R. Konda, John N. Tsitsiklis
- Origin: https://papers.nips.cc/paper/1786-actor-critic-algorithms.pdf
- Related:
- PyTorch4 tutorial of: actor critic…
-
Hi,
I would like to ask whether there is a jax-based code.
And whether there are some recommendations about jax-based offline rl algorithms.
Thanks!
-
In a multi-agent setting, when training e.g. `MAPPO_Agents()`, then calling `MAPPO_Agents.save_model(model_name='model.pth')` and finally loading the model `MAPPO_Agents.load_model(path)`, how can I e…
-
The paper "gCastle: A Python Toolbox for Causal Discovery" claims that "gCastle includes ... with **optional GPU acceleration**". However, I don't know how GPU acceleration can be used on this package…
-
**Is your feature request related to a problem? Please describe.**
Model-based offline RL algorithms which are able to handle image inputs are necessary for some environments.
**Describe the solut…
-
Based on the available expert controller design examples (PI-based inner current/voltage control + droop control for power sharing) it will be very interesting to highlight the shortcomings and adavan…
-
min_b Ab - y
b = A^+ y
where A^+ is the [pseudoinverse](https://en.wikipedia.org/wiki/Moore%E2%80%93Penrose_inverse) of A
See *Introduction to algorithms* by *TH Cormen, CE Leiserson, RL Rivest, …
-
### What happened + What you expected to happen
code:
`from ray.rllib.algorithms.dreamerv3 import DreamerV3Config
config = (
DreamerV3Config()
.environment("CartPole-v1")
.training…
-
### What happened + What you expected to happen
Using the new V2 API stack raises a `NotImplementedError` in `BatchIndividualItems(ConnectorV2)`
### Versions / Dependencies
Google Colab, Pyth…
-
Can you explain the principles of code design, especially how the ddpg algorithm relates to portfolios