actor-critic-algorithm Search Results

749 results
for actor-critic-algorithm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/rl #90

[Feature Request] Please provide the genetic and low-level f…

hi, it's really great that facebookresearch is considering provide a library for reinforcement learning research. it would be very helpful if the library provide the low-level functionality rather …

walkacross updated 2 years ago
9
tensorflow/rust #380

Tensorboard examples and/or documentation do not exist

Thanks a lot for maintaining Rust bindings! I have built a very simple actor critic algorithm in Rust, and it works like a charm. However, how can I emit events for tensorboard to get fancy graphs? …

Trolldemorted updated 1 year ago
3
satoshi-kosugi/Unpaired-Image-Enhancement #1

could you please tell me how to train it on the GPU

thanks for your nice work but Chainer is the first time use it

moyi7712 updated 4 years ago
1
PaddlePaddle/PARL #832

【PARL】PPO算法示例并行化遇到 TypeError: cannot pickle 'ParamBase' obje…

序列化有哪些注意事项呀，复杂的对象一般要如何处理，在agent上直接注释就变成这样了

luoxiao21024 updated 2 years ago
7
rl-tools/rl-tools #3

Would this library be usable to do a real time deep learning…

I want to make a furuta pendulum. Like [This](https://www.google.com/imgres?imgurl=https%3A%2F%2Fwww.researchgate.net%2Fpublication%2F227017529%2Ffigure%2Ffig1%2FAS%3A302327165669385%401449091821542%2…

DaxLynch updated 6 months ago
2
tensorflow/agents #507

SAC for discrete action (GumbelSoftmax reparameterization tr…

Hello, I need to make SacAgent work with discrete action, so try to implement GumbelSoftmax parameterization trick by re-defining the relevant classes. However, the calculation of `agent.train(experie…

Kang-SungKu updated 1 year ago
11
leggedrobotics/legged_gym #16

Random ValueError

Hi I have been training with a custom robot based on the a1 example. I repeatedly get the following error, random number of seconds into the training: ``` Traceback (most recent call last): …

Cdfghglz updated 5 months ago
7
thu-ml/tianshou #959

Don't pass optimizers to policies

This issue grew out of the discussion in https://github.com/thu-ml/tianshou/pull/950#discussion_r1342174137_ ## Summary Currently, all policies take actor/critic/critic2 optimizers t…

MischaPanch updated 8 months ago
2
DongChen06/MARL_CAVs #43

Training data and evaluation data

Hello! I noticed that the maximum eposides can be controlled by MAX_EPISODES during training, and EVAL_INTERVAL determines the evaluation intervals; however, the evaluation process seems to determi…

zcysun updated 3 months ago
10
rail-berkeley/softlearning #203

The issue of softlearning implementation

I attempted to implement softlearning with the usage mujoco210, but it appears to be unsuccessful. Is there currently an incompatibility issue between softlearning and mujoco210?

Ziyu0118 updated 1 year ago
1

上一页 1...4 5 6 7 8 9 10...75 下一页

749 results for actor-critic-algorithm

749 results
for actor-critic-algorithm