actor-critic-algorithm Search Results

752 results
for actor-critic-algorithm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mpSchrader/gym-sokoban #11

Success Rate?

Hello I am amazed by your work. I am wondering if you tested the Sokoban's game on the standard RL method (Q learning, A2C, ec), and wondering if you have success rate for this kind of game?

dikke updated 3 years ago
14
InternLM/xtuner #770

Citation for OpenRLHF in relation to the XTuner RLHF code an…

Hi, XTuner Team Could you please add a citation for the source of the Ray+vLLM-based RLHF architecture - OpenRLHF, such as in the README.md file: https://github.com/InternLM/xtuner?tab=readme-ov-fi…

hijkzzz updated 2 months ago
7
isaac-sim/IsaacLab #931

[Question] Is there a solution for the Patch buffer overflow…

### Question I'm looking for a solution to this error. ` [INFO]: Base environment: Environment device : cuda:0 Physics step-size : 0.005 Rendering step-size : 0.02 Environment …

H-Hisamichi updated 2 weeks ago
2
yanpanlau/DDPG-Keras-Torcs #14

Unable to run the package. TypeError: Expected int32, got li…

Hi, Thanks for your package and the article along. Unfortunately, I am not able to test your package, receiving the following error after issuing command python ddpg.py in gym_torcs directory: …

Amir-Ramezani updated 6 years ago
12
nisheeth-golakiya/hybrid-sac #1

Question on formula of the continuous action

First, thank your for the code related to paper `Discrete and Continuous Action Representation for Practical RL in Video Games`. Second, according to your code, all of action spaces of the environ…

dbsxdbsx updated 2 years ago
7
rl-tools/rl-tools #1

How to save weights to a file

Is there a way to save weights to a file and reload them later? For instance, in the car example there is ui.cpp which lets the user control the car, and car.cpp appears to train it. I am assuming may…

sbond75 updated 4 months ago
5
p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch #78

ConnectionResetError: [Errno 104] Connection reset by peer

AGENT NAME: A3C 1.1: A3C TITLE CartPole layer info [20, 10, [2, 1]] layer info [20, 10, [2, 1]] {'learning_rate': 0.005, 'linear_hidden_units': [20, 10], 'final_layer_activation': ['SOFTMAX', …

JinQiangWang2021 updated 11 months ago
2
google-deepmind/mujoco #1182

The CG Solver in MJX dosen't support reverse-mode differenti…

I'm trying to differentiate the MJX step function via the autograd function `jax.grad()` in JAX, like: ``` def step(vel, pos): mjx_data = mjx.make_data(mjx_model) mjx_data = mjx_data.replace(q…

LyuJ1998 updated 2 weeks ago
6
pytorch/rl #883

[Feature Request] Doc revamp

## Motivation Plan for the doc revamp - [x] A 0-to-1 tutorial or Getting started #861 - [x] A tutorial on building a custom env #911 - [ ] A tutorial on model ensembling #876 - [x] A tutorial…

vmoens updated 8 months ago
16
ikostrikov/pytorch-a2c-ppo-acktr-gail #126

Create config for all algorithms

Instead of using different default arguments for different algorithms, create config files and load arguments from there.

ikostrikov updated 5 years ago
6

上一页 1...12 13 14 15 16 17 18...76 下一页

752 results for actor-critic-algorithm

752 results
for actor-critic-algorithm