actor-critic-algorithm Search Results

750 results
for actor-critic-algorithm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

thu-ml/tianshou #1029

Wrong output of forward for custom policy

- [ ] I have marked all applicable categories: + [x] exception-raising bug + [x] RL algorithm bug + [ ] documentation request (i.e. "X is missing from the documentation.") + [ ] ne…

hazel260802 updated 7 months ago
1
mhauskn/dqn-hfo #37

Understanding Gradient Inversion

Sorry, if this appears to be a stupid question. I am trying to implement gradient inversion using PyTorch based on the paper but I would like to ask for some clarifications. Is the inversion done on a…

richielo updated 5 years ago
6
epignatelli/helx #55

Implement DeepRL agents

`Agent`s are entities with a `sample_action` and `update` method, in potence. We exclude from the list exploration strategies and curricula. _Implement_ means either to produce new code from the pape…

epignatelli updated 1 year ago
1
patrick-kidger/equinox #596

Unstable PPO?

Hello Patrick, I am doing an implementation of the PPO algorithm for a custom environment and first wanted to test things out with a standard example and I choose CartPole-v1 implemented with [`gym…

stergiosba updated 9 months ago
7
ugo-nama-kun/DQN-chainer #2

Inquiries regards to target model update

Hello, Thanks for sharing this great code of chainer based DQN. I recently started to use chainer. The code works great for me and I would like to implement a critic-actor architecture based on your …

originholic updated 8 years ago
4
Farama-Foundation/Gymnasium-Robotics #193

[Question] Algorithm / Parameters for Ant Maze

### Question I tried hard to train an agent to solve any of the AntMaze environments. I tried the stable baselines 3 implementations of SAC (dense and sparse) and PPO, but could not solve even a sm…

meppe updated 9 months ago
5
QiXuanWang/LearningFromTheBest #9

When to Trust Your Model: Model-Based Policy Optimization By…

Link: [Arxiv](https://arxiv.org/pdf/1906.08253.pdf) Code: https://github.com/JannerM/mbpo This paper is very similar to "Benchmarking Model-Based Reinforcement Learning" #5 . This paper is publis…

QiXuanWang updated 4 years ago
1
rlcode/reinforcement-learning #48

Failing to converge with increase in grid-size (Grid World)

If I increase both the HEIGHT and WIDTH from 5 to 10 keeping the obstacles and the final goal at the same position, Deep SARSA network doesn't seem to converge. What do you think is the problem? Shoul…

akileshbadrinaaraayanan updated 7 years ago
5
openai/baselines #494

How to improve the success rate

How to improve the success rate, my goal is to use BAXTER robot to push the object to the target point in MUJOCO, my GYM environment has been completed, but his training success rate has been very low…

huangjiancong1 updated 5 years ago
2
chaobiubiu/DDMA #1

paper

Hello, I can't search your paper, has it been published yet? If possible, can you send a link to the paper. thank you.

Xiyou521 updated 1 year ago
3

上一页 1...7 8 9 10 11 12 13...75 下一页

750 results for actor-critic-algorithm

750 results
for actor-critic-algorithm