actor-critic-algorithm Search Results

766 results
for actor-critic-algorithm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

thu-ml/tianshou #169

Shared preprocess_net for actor and critic network

- [ ] I have marked all applicable categories: + [ ] exception-raising bug + [ ] RL algorithm bug + [ ] documentation request (i.e. "X is missing from the documentation.") + [ ] ne…

wuchlei updated 4 years ago
3
hill-a/stable-baselines #509

PPO2 learning rate schedule

Hi, I have trouble finding examples using learning rate schedule with PPO2 algorithm, although it seems possible to use it : https://github.com/hill-a/stable-baselines/blob/4a5f8d886953a94e7b0a…

jviquerat updated 4 years ago
7
tensorflow/tensorflow #42693

proximal policy gradient tensorflow pendulum issue

import gym import numpy as np import tensorflow as tf class Memory(object): def __init__(self): self.ep_obs, self.ep_act, self.ep_rwd, self.ep_neglogp = [], [], [], []…

EpicSpaces updated 4 years ago
3
openai/spinningup #74

multi-cpu problem in experiment grid

@jachiam Hi! It's me again! 2 days ago I posted an issue on using multiple cpu on ExperimentGrid that seems to only give the wrong log when run in Pycharm but fine in terminal. I did some more experim…

watchernyu updated 4 years ago
6
rickstaa/LAC-TF2-TORCH-translation #9

Compare tf eager with tf Graph

# Problem description The translated code is not working in when eager execution (default in tf2) is enabled. I thas similar behaviours as the PyTorch code. I will, therefore need to compare the tw…

rickstaa updated 4 years ago
12
PaddlePaddle/PARL #332

[DDPG]PaddleCheckError 提示变量形状不匹配,具体报错如下

[07-08 00:22:31 MainThread @logger.py:224] Argv: D:/Envs/SmartCar/DDPG/train.py C:\Users\Administrator\AppData\Local\Programs\Python\Python37\lib\importlib\_bootstrap.py:219: RuntimeWarning: numpy.uf…

zbp-xxxp updated 4 years ago
4
thu-ml/tianshou #98

ActorProb has redundant activation function in SAC examples

- [ ] I have marked all applicable categories: + [ ] exception-raising bug + [ ] RL algorithm bug + [ ] documentation request (i.e. "X is missing from the documentation.") + [ ] ne…

danagi updated 4 years ago
1
ray-project/ray #10153

`None` in the fetching list

Hi, I participate in [this challenge](https://www.aicrowd.com/challenges/neurips-2020-procgen-competition), which requires using `ray[rllib]==0.8.6`. I've implemented an algorithm and it works wit…

xlnwel updated 4 years ago
2
PaddlePaddle/PARL #363

在parl 的model 里面增加一个网络

你好我想在parl 基础上设计其他的方法其中model 里面有了 action网络和critic网络我还想再弄一个predict网络我增加后，在执行目标网络到当前网络复制函数时会报错 ![image](https://user-images.githubusercontent.com/46389180/88387895-09def400-cde6-11ea-8bbe-43303f…

kabuwaniu updated 4 years ago
5
facebookresearch/torchbeast #12

Should we update the ValueNet and PolicyNet with the differe…

In the original paper of IMPALA, the state value estimation and the action were the output of the same net, and the net was updated with the sum of three losses , which is not usual in the actor-criti…

YuhwaChoong updated 4 years ago
1

上一页 1...59 60 61 62 63 64 65...77 下一页

766 results for actor-critic-algorithm

766 results
for actor-critic-algorithm