sample-efficient-rl Search Results

311 results
for sample-efficient-rl

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

rl-tools/rl-tools #1

How to save weights to a file

Is there a way to save weights to a file and reload them later? For instance, in the car example there is ui.cpp which lets the user control the car, and car.cpp appears to train it. I am assuming may…

sbond75 updated 5 months ago
5
ML-HK/paper-discussion-group #11

RECOMMEND/VOTE Papers

# How to recommend We can recommend some papers for further discussion under this issue. Include a link to the paper + the conference name and other related information (like the abstract, some bas…

sxjscience updated 7 years ago
4
QiXuanWang/LearningFromTheBest #5

Benchmarking Model-Based Reinforcement Learning By: Tingwu …

Link: [arxiv](https://arxiv.org/pdf/1907.02057.pdf) Problem: > Model-based reinforcement learning (MBRL) is widely seen as having the potentialto be significantly more sample efficient than model-…

QiXuanWang updated 4 years ago
1
dimitri-rusin/oll_onemax #11

Visualising and analysis

Two performance metrics for quantifying performance of an RL training process: (1) #hitting times: during the training, we evaluate the currently trained policy at every k (=2000) time steps. A "go…

ndangtt updated 7 months ago
2
dotnet/machinelearning #181

Reinforcement learning

I've looked into the available documentation and examples, but haven't been able to figure out if it is possible to use the ML.NET in its current state for (non-deep) reinforcement learning. If it is …

jarnmo updated 1 month ago
45
makaveli10/rl #4

Deep Q-Learning

- With small enough state and action space, we can use arrays and tables representations to approximate the value functions. - But with large exponential state spaces for e.g. tetris with ```10^60```…

makaveli10 updated 1 year ago
4
thu-ml/tianshou #612

Improve discrete control offline RL benchmark

- [x] I have marked all applicable categories: + [ ] exception-raising bug + [ ] RL algorithm bug + [ ] documentation request (i.e. "X is missing from the documentation.") + [x] ne…

nuance1979 updated 2 years ago
10
irthomasthomas/undecidability #731

LlamaGym: Online Reinforcement Learning for LLM-based agents…

- [ ] [LlamaGym/README.md at main · KhoomeiK/LlamaGym](https://github.com/KhoomeiK/LlamaGym/blob/main/README.md?plain=1) # LlamaGym/README.md at main · KhoomeiK/LlamaGym DESCRIPTION: Fine-tune LL…

irthomasthomas updated 7 months ago
1
AI4Finance-Foundation/FinRL #1062

\finrl\config.py which parameter is the best for apply to al…

I have defined specific ranges for each hyperparameter, and I want to find the best parameter for A2C or other algorithms. However, there can be numerous combinations, and how can I find the best para…

omerfirat updated 1 year ago
1
TinyMPC/TinyMPC #41

Struggling to get good performance when constraints are acti…

Hi there :wave: I'm exploring using TinyMPC to generate MPC controllers from Julia the JuliaControl ecosystem, and have setup a small little example with 5 state variables, 1 input and a predictio…

baggepinnen updated 2 weeks ago
5

上一页 1...1 2 3 4 5 6 7...32 下一页

311 results for sample-efficient-rl

311 results
for sample-efficient-rl