rl-algorithm Search Results

1000+ results
for rl-algorithm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

jjkke88/RL_toolbox #1

Unable to train model

On executing trpo_continous.py, I get the following error: > [2017-07-01 23:52:58,375] Making new env: CartPole-v0 > [TL] InputLayer continous_shared/continous_input_layer: (?, 3) > [TL…

abhinavrai44 updated 7 years ago
1
pengyanghua/DL2 #6

train.py is an implementation of A3C or A2C ?

In train.py, I see a central agent，SL agent and RL agents. They are running in different CPU cores with multiprocessing package. And RL agents get the weights of policy and value network from central …

wanziyu updated 2 years ago
1
harvard-acc/smaug #101

How to port Rinforcement learning algorithm on smaug?

Hello, I'm trying to port RL maddpg algorithm on smaug. Is there any documentation you can provide to follow for integrating the new algorithm? Thanks

kailashg26 updated 2 years ago
2
google-research/batch_rl #30

JAX code

Hi, I would like to ask whether there is a jax-based code. And whether there are some recommendations about jax-based offline rl algorithms. Thanks!

lucasliunju updated 1 year ago
11
ray-project/ray #42393

[rllib] KeyError: 'policy_ids' when loading DreamerV3 Checkp…

### What happened + What you expected to happen I am having issues loading a DreamerV3 checkpoint for inference. Similar to what was discussed in #40312, I assume it has to do with the old/new API. …

defrag-bambino updated 6 months ago
2
flow-project/flow #549

Support for Swarm Intelligence and Evolutionary Algorithms

I am interested in using Flow for VANETs (Vehicular Ad hoc NETworks) routing protocols, which play a key role in the design and development of Intelligent Transportation Systems. Besides RL, genetic…

BHGEdeveloper updated 5 years ago
3
MichalisPanayides/PhD-meetings #82

2022-02-15 meeting overview

# What has been done: - Machine Learning course: - Week 5 ✔️ - Week 6 ⌚ - EURO2022 - abstract ⌚ - Reinforcement Learning: - Restructure code - Plot some form mean policy next to…

MichalisPanayides updated 2 years ago
1
huawei-noah/trustworthyAI #140

Running on GPU

The paper "gCastle: A Python Toolbox for Causal Discovery" claims that "gCastle includes ... with **optional GPU acceleration**". However, I don't know how GPU acceleration can be used on this package…

zhj2022 updated 10 months ago
5
MichalisPanayides/PhD-meetings #90

2022-05-31 meeting agenda

# What has been done: - Gregynog ✔️ - Booking flights/hotels/registrations for conferences ⌚ - Julia course ✔️ - Fixed Reinforcement Learning algorithm bug ✔️ - Output of RL algorithm 💯🎉 # To …

MichalisPanayides updated 2 years ago
5
Miffyli/ToriLLE #6

Are there any plans to convert this to gymnasium (from gym)?

My understanding is that most RL algorithms will focus on supporting gymnasium going forward and that will be the standard. Trying to get ray rllib or other RL libraries with gym environments is prett…

contravarianceisuseful updated 8 months ago
2

上一页 1...8 9 10 11 12 13 14...100 下一页

1000+ results for rl-algorithm

1000+ results
for rl-algorithm