ddpg-agent Search Results

840 results
for ddpg-agent

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

zhouyou-gu/drl-5g-scheduler #4

#code structure issue

Hello, I'd like to reproduce the simulation results in Part Seven of the paper, but I'm not sure which part of the code corresponds to this part. According to the instructions in readme，may I ask if t…

xiaohui7 updated 4 months ago
23
hill-a/stable-baselines #545

DDPG log output uses scientific notation too soon for episod…

Here's an example intermittent print out from DDPG: ``` -------------------------------------- | reference_Q_mean | 49.8 | | reference_Q_std | 6.61 | | reference_action_m…

jkterry1 updated 5 years ago
1
MorvanZhou/Reinforcement-learning-with-tensorflow #178

关于大迷宫（例如100x100）求解问题，适合什么强化学习算法？

教学中的迷宫规模都比较小，不复杂。如果想要求解大规模，如100*100的迷宫，且环境比较复杂的，应该选用什么强化学习算法？我试了几种算法，发现Q-learning貌似求出的不是最优解，而DQN的训练速度太慢，难以求得解。想请问下是什么原因导致的这些问题，随机策略选择还是其他参数设置的问题？或者有什么比较适合的强化学习算法嘛？求大神指导！谢谢！

TimDingg updated 6 months ago
2
takuseno/d3rlpy #216

[REQUEST] Support for Recurrent Neural Network variant of RL…

Hi @takuseno , Thanks for this amazing repo and its really helpful and really appreciate your efforts . I could see key algorithms related to discrete and continuous action space are covered alre…

itsmesaisatish updated 2 years ago
2
PaddlePaddle/PARL #612

无法引用：cannot import name 'layers' from 'parl

在B站学习paddleDQN算法时尝试复现，运行代码报错：ImportError: cannot import name 'layers' from 'parl' (C:\Users\lenovo\anaconda3\envs\paddle_env\lib\site-packages\parl\__init__.py) paddle版本是：paddlepaddle-gpu 2.0…

longqianlanqiu updated 2 years ago
10
RomDeffayet/DDPG_multi_agent #1

Benchmarking and visualizing the episodes

Hello Mr. Deffayet, Thank you so much for this amazing repository! I'm currently using your project to better understand the implementation of DDPG. In trying to better understand this algorithm, I…

tulbahfh updated 2 years ago
4
openai/baselines #494

How to improve the success rate

How to improve the success rate, my goal is to use BAXTER robot to push the object to the target point in MUJOCO, my GYM environment has been completed, but his training success rate has been very low…

huangjiancong1 updated 6 years ago
2
openai/baselines #482

DDPG with ou_0.2 noise does not converge in MountainCarConti…

Has anyone got DDPG with ou_0.2 noise parameter to converge in MountainCarContinuous-v0 environment? The rollout/return_history stays around -10 after 1 million steps. In the ddpg paper, MountainCarCo…

ghost updated 5 years ago
9
cpnota/autonomous-learning-library #210

Continuous Butterfly Presets

Add multiagent presets for the [Butterfly continuous environments](https://www.pettingzoo.ml/butterfly).

cpnota updated 9 months ago
3
drumpt/AI-Soccer-Code-Generator #5

To Do - Deep Learning System

# To Do (Urgent) - [x] 3 types of State functions - Code Template - [x] 3 types of Action functions - Code Template - [x] 3 types of Reward functions - Code Template - [x] Finish code template f…

lfelipesv updated 4 years ago
1

上一页 1...6 7 8 9 10 11 12...84 下一页

840 results for ddpg-agent

840 results
for ddpg-agent