td3-pytorch Search Results

146 results
for td3-pytorch

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ray-project/ray #10653

[rllib] DDPG TWIN_Q LOSS ERROR

The TD3 loss according to OpenAI Spinning up is: ![L1](https://spinningup.openai.com/en/latest/_images/math/7d5c18f49a242cc3eec554f717fe4f3bfc119bab.svg) ![L2](https://spinningup.openai.com/en/lat…

mvindiola1 updated 4 years ago
1
thu-ml/tianshou #29

potential bug in the implementation of DDPG

- [X] I have marked all applicable categories: + [X] exception-raising bug + [X] RL algorithm bug + [ ] documentation request (i.e. "X is missing from the documentation.") + [ ] ne…

lmzhangucsd updated 3 years ago
18
DLR-RM/rl-baselines3-zoo #63

Several trials with the same value [question]

Good day. I'm trying zoo now on a custom environment, and I'm getting a couple of questions. - There are many trials that finished with the exact same value, and there's more than 1 instance of tha…

rick-oster updated 3 years ago
2
DLR-RM/stable-baselines3 #93

[enhancement] Polyak Averaging could be done faster

This is rather minor, but polyak averaging in DQN/SAC/TD3 could be done faster with far fewer intermediate tensors using `torch.addcmul_` https://pytorch.org/docs/stable/torch.html#torch.addcmul.

m-rph updated 4 years ago
5
hill-a/stable-baselines #1005

[question] Remove bias from neural network model.

Hello, I'm using TD3 for training a MLP policy in a custom environment and I would like to know what is the appropriate way to remove the bias from the neural network model, since I would like to h…

wilsonsamarques updated 4 years ago
10
DLR-RM/rl-baselines3-zoo #18

[question] Cannot enjoy the trained agents.

After cloning the rl-baselines3-zoo, I was trying to train my own agent. By : **python train.py --algo algo_name --env env_id** After that, I used **python enjoy.py --algo td3 --env AntBulletEnv-v…

Litao917 updated 4 years ago
17
tensorflow/tensorflow #25349

Library Conversion: Open AI Baselines

**[OpenAI Baselines](https://github.com/openai/baselines)** is a set of high-quality implementations of reinforcement learning algorithms. These algorithms make it easier for the research community to…

dynamicwebpaige updated 4 years ago
32
hill-a/stable-baselines #878

Cannot enjoy the trained agents when using stable-baselines3…

After cloning the rl-baselines3-zoo, I was trying to train my own agent. By : **python train.py --algo algo_name --env env_id** After that, I used **python enjoy.py --algo algo_name --env env_id…

Litao917 updated 4 years ago
2
hill-a/stable-baselines #845

Continual learning becomes really slow.

Hi, I am using following code to resume an interrupted training using TD3.load(). However, the training speed is much slower than before. Here environment is the same as before. Each episode (with th…

blurLake updated 4 years ago
4
DLR-RM/stable-baselines3 #27

Add support for pretraining [feature request]

First: I'm very happy to see the new PyTorch SB3 version! Great job! My question is whether pretraining-support is planned for SB3 (like for SB: https://stable-baselines.readthedocs.io/en/master/g…

skervim updated 4 years ago
17

上一页 1...9 10 11 12 13 14 15...15 下一页

146 results for td3-pytorch

146 results
for td3-pytorch