rl-algorithms Search Results

1000+ results
for rl-algorithms

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

number9473/nn-algorithm #247

Actor-Critic Algorithms

# Actor-Critic Algorithms # - Author: Vijay R. Konda, John N. Tsitsiklis - Origin: https://papers.nips.cc/paper/1786-actor-critic-algorithms.pdf - Related: - PyTorch4 tutorial of: actor critic…

joyhuang9473 updated 6 years ago
2
google-research/batch_rl #30

JAX code

Hi, I would like to ask whether there is a jax-based code. And whether there are some recommendations about jax-based offline rl algorithms. Thanks!

lucasliunju updated 1 year ago
11
agi-brain/xuance #49

How to export or load multi-agent policies?

In a multi-agent setting, when training e.g. `MAPPO_Agents()`, then calling `MAPPO_Agents.save_model(model_name='model.pth')` and finally loading the model `MAPPO_Agents.load_model(path)`, how can I e…

ardian-selmonaj updated 1 month ago
6
huawei-noah/trustworthyAI #140

Running on GPU

The paper "gCastle: A Python Toolbox for Causal Discovery" claims that "gCastle includes ... with **optional GPU acceleration**". However, I don't know how GPU acceleration can be used on this package…

zhj2022 updated 9 months ago
5
takuseno/d3rlpy #58

[REQUEST] Adding model-based offline RL with image inputs li…

**Is your feature request related to a problem? Please describe.** Model-based offline RL algorithms which are able to handle image inputs are necessary for some environments. **Describe the solut…

kargarisaac updated 3 years ago
8
upb-lea/openmodelica-microgrid-gym #51

Add examples highlighting the learning process with state-of…

Based on the available expert controller design examples (PI-based inner current/voltage control + droop control for power sharing) it will be very interesting to highlight the shortcomings and adavan…

wallscheid updated 3 years ago
2
make-github-pseudonymous-again/js-algorithms #56

Least-squares approximation

min_b Ab - y b = A^+ y where A^+ is the [pseudoinverse](https://en.wikipedia.org/wiki/Moore%E2%80%93Penrose_inverse) of A See *Introduction to algorithms* by *TH Cormen, CE Leiserson, RL Rivest, …

make-github-pseudonymous-again updated 4 years ago
1
ray-project/ray #42959

dreamerv3 failed at config.build()

### What happened + What you expected to happen code: `from ray.rllib.algorithms.dreamerv3 import DreamerV3Config config = ( DreamerV3Config() .environment("CartPole-v1") .training…

lailing2000 updated 2 months ago
3
ray-project/ray #45464

[<Ray component: RLlib] enable_env_runner_and_connector_v2 N…

### What happened + What you expected to happen Using the new V2 API stack raises a `NotImplementedError` in `BatchIndividualItems(ConnectorV2)` ### Versions / Dependencies Google Colab, Pyth…

timborden updated 3 months ago
2
matinaghaei/Portfolio-Management-ActorCriticRL #1

Ask Code Principle

Can you explain the principles of code design, especially how the ddpg algorithm relates to portfolios

SuGuilin updated 3 months ago
1

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for rl-algorithms

1000+ results
for rl-algorithms