td3-pytorch Search Results

145 results
for td3-pytorch

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

reiniscimurs/DRL-robot-navigation #147

Slow training speed and low GPU utilization

Hi, thank you for your work, it's amazing! I'm a student who just started DRL. I set up the simulation environment according to the tutorial and used your original program to train (by executing 'pyth…

Sau1-Goodman updated 5 months ago
4
DLR-RM/rl-baselines3-zoo #321

[Feature Request] Support Stochastic Weight Averaging (SWA) …

### 🚀 Feature Stochastic Weight Averaging (SWA) is a recently proposed technique can potentially help improve training stability in DRL. There is now a new implementation in `torchcontrib`. Quoting/p…

pchalasani updated 2 years ago
2
harfang3d/dogfight-sandbox-hg2 #94

Training without network mode

Excuse me, Is there any method that does not require network mode training? Because I think the network communication time may affect the execution speed of each step in RL and thus affect the traini…

fangyichen123 updated 1 month ago
10
Jingliang-Duan/DSAC-v2 #4

Request for source code of TD4

Hey Jingliang, just saw your paper for the first version of DSAC. It is impressive and clear. I am quite interested in the implementation of TD4, which is quite interesting in my view. Would you …

ShengrenHou updated 1 year ago
2
reiniscimurs/GDAE #7

execute GDAM.py

Sorry, I'm here to ask a question again I'm trying to execute GDAM.py i can't find the file OSError: File /home/wenzhi/GDAE/Code/assets/launch/multi_robot_scenario.launch does not exist ![2023-0…

chih7715 updated 1 year ago
23
openai/safety-starter-agents #4

sac-lagrangian shows poor performance on PointGoal1?

On running the lagrangian version of SAC I get the following curve for costs. I tried changing the constraint limits to a range of values and didn't get much benefit: ![lagrangian_sac_pointgoal1](h…

hari-sikchi updated 7 months ago
16
openai/spinningup #329

serious BUG in sac pytorch implementation

SAC algorithm in PyTorch implementation has a serious bug `q_params = itertools.chain(ac.q1.parameters(), ac.q2.parameters())` `itertools.chain` will become empty after the first iteration, so e…

zlw21gxy updated 2 years ago
5
assume-framework/assume #398

Early Stopping, Learning rate and noise decay

Implement the best practices from multi-agent Rl community and stablebaselines3 into our algorithm. Further analyse similarities between petting zoo multi-agent implementation to current RL implementa…

kim-mskw updated 2 months ago
3
IBM/rl-testbed-for-energyplus #34

[Question] Make the environment work with stable_baselines

I've tried to make the environment work with the baselines fork stable_baselines (https://github.com/hill-a/stable-baselines). It runs, but the results shown when I'm running plot_energyplus is always…

maxvfischer updated 2 years ago
21
gouxiangchen/ac-ppo #1

可以做一个single policy的版本么

不使用double policy的请看下，你的代码可以收敛么？我听别人说直接使用tanh再distribution 后sample会影响熵的计算，不知道为什么，可以问下么

heyfavour updated 3 years ago
6

上一页 1...1 2 3 4 5 6 7...15 下一页

145 results for td3-pytorch

145 results
for td3-pytorch