sample-efficient-rl Search Results

nilscrm/stackelberg-ml #41

More Sample Efficient MAL

Our improved MAL solutions currently take lots of samples but implementing model-learning as RL with a complex NN describing the agents "policy" might not be necessary, when essentially all we want to…

YanickZengaffinen updated 2 months ago

Kaixhin/Rainbow #23

Future improvements

First, hands down, amazing work. Serving as a baseline, I see a possible improvement, if someone wants to implement it: - The n-step return, as it is, is biased (as you are using old off-policy sam…

jaromiru updated 3 years ago

pagand/model_optimze_vessel #29

Sprint #11 (July 13)

Feedback: Modelling: - Do not need too much epochs (Use Wandb to simultaneously visit your performance. Rule of thumb, around 50 is enough) - Validate after each epoch - Consider using t11-t16 samples…

pagand updated 1 year ago

hmsc-r/HMSC #189

Spatial Model running extremely slow

Hi, I was running a spatial dataset of size 2,419 sampling units, 17 covariates (8 continuous covariates with 2nd order enabled plus 1 intercept), 3 species, and 1 spatial random level. I found tha…

LamuelCH updated 2 months ago

Stable-Baselines-Team/stable-baselines3-contrib #206

[Feature Request] BBF algorithm implementation

### 🚀 Feature Can a recent BBF algorithm be added to the library- https://github.com/google-research/google-research/tree/master/bigger_better_faster? ### Motivation It is model free single agent R…

Alian3785 updated 1 year ago

ML-HK/paper-discussion-group #12

Log of discussed papers

For reference, we will collect a list of discussed papers as well as the date of discussion in this issue.

leezu updated 7 years ago

rl-tools/rl-tools #1

How to save weights to a file

Is there a way to save weights to a file and reload them later? For instance, in the car example there is ui.cpp which lets the user control the car, and car.cpp appears to train it. I am assuming may…

sbond75 updated 3 months ago

Bellman-devs/bellman #1

add a development plan

link to in contribution guidelines, what is in scope etc

hstojic updated 3 years ago

AnSrwn/Parkr #10

Train model with SAC or GAIL

> Der nächste Schritt wäre einen Agenten mit zwei Optimierungsalgorithmen zu trainieren. Hierfür könnten Sie im Reinforcement Learning-Bereich den PPO und DQN Algorithmus verwenden. Sie könnten aber a…

AnSrwn updated 4 years ago

rock-learning/bolero #9

Add more policy search algorithms and policy representations

Policy Search - [ ] [PI2](http://proceedings.mlr.press/v9/theodorou10a/theodorou10a.pdf), is already implemented #28 - [ ] [PoWER](http://www.ias.informatik.tu-darmstadt.de/publications/peters_ADPR…

AlexanderFabisch updated 5 years ago

304 results for sample-efficient-rl

304 results
for sample-efficient-rl