rl-algorithms Search Results

1000+ results
for rl-algorithms

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

thu-ml/tianshou #1215

Towards Release 2.0.0

This issue serves for informing about and discussing the next major release of Tianshou, after which the library can be considered mature and stable from our perspective. The progress and the related …

MischaPanch updated 3 hours ago
4
nicklashansen/tdmpc2 #39

[Question] Solving POMDP in TDMPC2

First of all, thank you for open-sourcing this algorithm. I am trying to train a quadruped robot locomotion policy with multimodal input, including egocentric depth vision, on complex terrain. Howev…

gmmyung updated 2 weeks ago
1
uwreact/uwreact_robot #45

Train and benchmark several RL algorithms

# 🚀 Feature Request Part of #42. Depends on #44. Once an environment is set up, it will be easy to train several of the RL algorithms provided by `pytorch`. All of these algorithms should b…

ghost updated 5 years ago
3
shamilmamedov/flexible_arm #27

Tuning of the RL and IRL algorithms

**Description**: The RL and IRL algorithms need tuning to perform well (especially the Adversarial ones). We need to put some time and tune them and see if they can perform well if we want to use the…

Erfi updated 1 year ago
5
opendilab/DI-engine #813

How to introduce other optimizers into DI-engine?

- [X] I have marked all applicable categories: + [ ] exception-raising bug + [ ] RL algorithm bug + [ ] system worker bug + [ ] system utils bug + [X] code design/refactor …

weidaolee updated 2 months ago
2
rlworkgroup/garage #2214

Discrete action-space for Meta-RL algorithms

Hello, Thanks for this great library. I have a question. I want to use RL2 and specifically the RL2TRPO algorithm with discrete action-space. However, it seems that the current implementation do…

maniset updated 3 years ago
6
rail-berkeley/rlkit #53

Investigate super-convergence on RL algorithms

I have been using these two routines to figure out the best learning rate to apply with awesome results on SAC. However, the changes in the `temperature` alter those values along the way. Probably wou…

redknightlois updated 5 years ago
4
ray-project/ray #47361

[RLlib] New API Stack: Action masking example issues in rele…

### What happened + What you expected to happen As per my best knowledge, the repro script covers the version of `action_masking_rlm.py` example file which was shipped in release 2.34. However, I adj…

PhilippWillms updated 4 days ago
4
thu-ml/tianshou #877

puzzle about policy learning of offline RL algorithms

- [ ] I have marked all applicable categories: + [ ] exception-raising bug + [ ] RL algorithm bug + [ ] documentation request (i.e. "X is missing from the documentation.") + [ ] ne…

GongYanfu updated 1 year ago
1
ray-project/ray #46843

[RLlib] `on_episode_end` callback is not called until after …

### What happened + What you expected to happen When using the `on_episode_end` callback, the environment is reset before the callback is called. This means that accessing internal environment vari…

Mark2000 updated 1 month ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for rl-algorithms

1000+ results
for rl-algorithms