advantage-actor-critic Search Results

291 results
for advantage-actor-critic

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

tambetm/simple_dqn #49

slow training speed in latest code?

I just tried the latest code, and found the training speed slowed down significantly, it used to be more than >200 steps_per_second, but right now it's ~100 steps_per_second 2017-09-24 15:08:08,844…

mw66 updated 6 years ago
9
rlcode/reinforcement-learning #76

couple a3c questions / recommendations for generalizing beyo…

First, thanks for making this. It's very easy to get started with and has really helped me move things forward on a personal project of mine I've been struggling with for months. This is really awesom…

M00NSH0T updated 6 years ago
3
open-rdc/IsaacGym_Wiki #3

Dockerを用いた環境構築の記録

DockerでIsaac Gymを環境構築する記録（メモ）実行環境 - Ubuntu 20.04 - NVIDIA driver version 550.54.15 - GeForce RTX 2080 Super 以下を参考に行う． https://valinux.hatenablog.com/entry/20240111

HarukiOgawa1 updated 2 months ago
2
huggingface/accelerate #2496

[RFC] Supporting multiple models with DeepSpeed

### System Info ```Shell PyTorch 2.2.1 DeepSpeed 0.13.4 ``` ### Information - [ ] The official example scripts - [ ] My own modified scripts ### Tasks - [ ] One of the scripts in the examples/ …

pacman100 updated 1 month ago
9
matomatical/jaxgmg #17

37 Implementation details of PPO

Our baselines use a PPO algorithm that is adapted from PureJaxRL. But it doesn't appear to stick to all of the relevant implementation details from [Huang et al., 2022](https://iclr-blog-track.github.…

matomatical updated 6 days ago
3
ML-HK/paper-discussion-group #11

RECOMMEND/VOTE Papers

# How to recommend We can recommend some papers for further discussion under this issue. Include a link to the paper + the conference name and other related information (like the abstract, some bas…

sxjscience updated 7 years ago
4
gaoyuankidult/einstein #2

A bug to solve

> Traceback (most recent call last): > File "clock_gated_rnn.py", line 63, in > model.compile(loss='binary_crossentropy', optimizer='adam', class_mode="binary") > File "/usr/local/lib/python2…

i3esn0w updated 8 years ago
9
AI4Finance-Foundation/FinRL #1011

Error occurs in PaperTrading_Demo notbook while running on C…

**Describe the bug** The bugs occur in Part 2 Train the agent while running train cell. The error message is ValueError: Too many values to unplack (expected 4) occurs in function explore_env. Here …

jiahau3 updated 7 months ago
16
29-75/running-car #7

Running car를 개발하기 위한 참고 가능한 자료를 모아보자

gon-park updated 3 years ago
12
ray-project/ray #45655

[RLlib] Unable to replicate original PPO performance

### What happened + What you expected to happen I can’t seem to replicate the original [PPO](https://arxiv.org/pdf/1707.06347) algorithm's performance when using RLlib's PPO implementation. The hyp…

rajfly updated 2 months ago
1

上一页 1...3 4 5 6 7 8 9...30 下一页

291 results for advantage-actor-critic

291 results
for advantage-actor-critic