rl-algorithm Search Results

1000+ results
for rl-algorithm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

thomashirtz/gym-hybrid #3

Algorithm results about PDQN/HPPO in gym-hybrid

Hi, this is a nice project for hybrid action space, and I see you mentioned PDQN/HPPO in `README.md`. Do you have some experiment results about these algorithms in this environment. If not, we want to…

PaParaZz1 updated 1 year ago
2
vdblab/vdblab-shotgun #92

Trimming: parameter tuning?

Following up on a discussion I had with @nickp60 earlier on whether or not we should retune the `bbduk` parameters during trimming (given that we have some reads that look like adapter/empty sequence …

miraep8 updated 2 weeks ago
3
rl-tools/rl-tools #3

Would this library be usable to do a real time deep learning…

I want to make a furuta pendulum. Like [This](https://www.google.com/imgres?imgurl=https%3A%2F%2Fwww.researchgate.net%2Fpublication%2F227017529%2Ffigure%2Ffig1%2FAS%3A302327165669385%401449091821542%2…

DaxLynch updated 7 months ago
2
hust-diangroup/ns3-ai #46

running rl-tcp with executing question

when I only run _python3 run_tcp_rl.py --use_rl --result_ , it will stop at showing build but I use the old method that simultaneously running _./waf --run "rl-tcp"_ and above, it will succeed in …

howie4701 updated 2 years ago
5
btx0424/OmniDrones #67

TDMPC Error

Thank you for creating this library, it really is amazing! However, when running train.py (algo=tdmpc, task=Hover) from /scripts I get the following error: ``` Traceback (most recent call last): …

ErinTUDelft updated 2 months ago
4
cosmicBboy/ml-research #23

[metalearn] support continuous policy action space

Support continuous action space for selecting real hyperparameters within the bounds specified by algorithm space config: - https://medium.com/@asteinbach/actor-critic-using-deep-rl-continuous-mounta…

cosmicBboy updated 4 years ago
1
ray-project/ray #40391

[RLlib] PPO Algorithm additional update fails when not using…

### What happened + What you expected to happen Setting `use_kl_loss=False` in PPO with new RL Module & Learner API fails due to an impossible-to-satisfy `assert` statement. Since line 500 in `ray.rl…

gresavage updated 11 months ago
2
MichalisPanayides/PhD-meetings #86

2022-03-22 meeting overview

# What has been done: - ML course ✔️ - Documentation course - 2/3 ✔️ - JOSS review ⌚ - Paper: - Restructured paper ✔️ - PR merged ✔️ - Diagram of game theoretic model ✔️ - Ch…

MichalisPanayides updated 2 years ago
1
werner-duvaud/muzero-general #185

MuZero Unplugged

Hey, I'm wondering if there is any intention to expand the code basis for MuZero unplugged to make it work in an offline RL setting?

tbskrpmnns updated 2 years ago
7
Stable-Baselines-Team/stable-baselines3-contrib #223

[Feature Request] STAC algorithm

### 🚀 Feature Build the STAC algorithm as a callable algorithm: https://arxiv.org/pdf/2002.12928.pdf ### Motivation Hyperparametrization is one of the most time/cost expensive thing when training R…

EloyAnguiano updated 8 months ago
4

上一页 1...12 13 14 15 16 17 18...100 下一页

1000+ results for rl-algorithm

1000+ results
for rl-algorithm