model-based-rl Search Results

1000+ results
for model-based-rl

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

cjekel/tindetheus #25

Reinforcement learning

Hey, any idea how reinforcement learning could be applied? based on review of decisions that network made

own2pwn updated 4 years ago
1
RobertTLange/gymnax #26

Differentiate step function ?

Hello, is it possible to return the differential of the step reward function (with respect to the action) at least for the simplest envs like pendulum, cartple ? Best, Jacek

dzako updated 11 months ago
6
ray-project/ray #46119

rllib/examples/action_masking.py not working on dreamerV3

### What happened + What you expected to happen /ray/rllib/examples/action_masking.py modify: replace action_masking.py line 97 "ppo.PPOConfig()" with" dreamerv3.DreamerV3Config()" bug: Va…

moganli updated 4 months ago
2
triton-inference-server/server #4485

SHARK Backend integration

[SHARK](https://github.com/nod-ai/SHARK) is a high performance codegen compiler and runtime built on MLIR, IREE and custom RL based tuning infrastructure. [Here](https://nod.ai/shark-the-fastest-runti…

powderluv updated 2 years ago
1
saitejabairu-code/Simulation-of-RC-RL-Circuit-Response-using-MATLAB-Problem-Statement #4

Create the mathematical model for RC and RL circuits

project : Simulation of RC/RL Circuit Response using Python or MATLAB Problem Statement: Team 2 Develop a script that simulates the transient response of an RC (Resistor-Capacitor) or RL (Resistor-Ind…

saitejabairu-code updated 2 weeks ago
1
allenai/RL4LMs #37

Persistent Variance in IMDB

In running experiments on IMDB, I found that there was a very high variance in validation and test set results and I don't fully understand it, so I'm looking for some advice. Here, I've run PPO f…

mnoukhov updated 1 year ago
1
tensorflow/tensorboard #6884

Tensorboard is showing no data

Hello, I am running the PPO algorithm of Ray RL lib. When I run the code the screen is like this: ![Screenshot from 2024-07-18 04-49-46](https://github.com/user-attachments/assets/7e11e7a6-d5d8-4e…

SExpert12 updated 3 months ago
5
GeminiLight/virne #32

Question about network slicing

Very impressive work! I would like to ask if it is possible to implement my RL algorithm on network slicing problem? In the network slicing, the action will be the allocation [a1,a2,a3, ... ,ak] of re…

HanchuZhou updated 3 weeks ago
11
lisongmechlab/lsml #789

Missile Spread is Incorrect

**LSML version:** (develop) **Java version:** 8u301 **Steps to reproduce issue:** 1. Open LSML 2. Add SRM/MRM/LRM/ATM/RL to mechs 3. Look in WeaponLab **Actual result:** The damage fallof i…

LiSongMWO updated 2 years ago
3
OpenRLHF/OpenRLHF #499

[RFC] Modularizing Sample Generation with Rating in PPO for …

## TL; DR This RFC proposes separating sample generation and reward model scoring from the original rollout process in PPO, enabling users to more flexibly customize sample generation and create sa…

zhuzilin updated 19 hours ago
5

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for model-based-rl

1000+ results
for model-based-rl