model-based-rl Search Results

1000+ results
for model-based-rl

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mlflow/mlflow #4448

Save custom model with mlflow

Hi everyone, First of all thank you for maintaining this tool! I am trying to save a RL model trained with stablebaselines3 via mlflow, not all information is needed from the model, and stable ba…

stela-leon updated 3 years ago
1
Nerogar/OneTrainer #313

[Feat]: Caption/tags enhancement with multimodal LLMs

### Describe your use-case. There are multiple simple models used in this repository: Blip, Clip and WD-taggers. However, when it comes to detailed description, they are all dwarfed by modern multi…

kabachuha updated 4 weeks ago
5
greenelab/deep-review #587

Objective-Reinforced Generative Adversarial Networks (ORGAN)…

https://arxiv.org/abs/1705.10843 "In unsupervised data generation tasks, besides the generation of a sample based on previous observations, one would often like to give hints to the model in order …

mrwns updated 7 years ago
1
ray-project/ray #38561

[RLlib] PPO instantiation requires torch even though tf is t…

### Issue Severity Minor: Workaround available, torch must be installed additionally. ### What happened + What you expected to happen PPO Trainer instantiation via RLModule API fails if I wan…

PhilippWillms updated 5 months ago
2
freqtrade/freqtrade #10542

FreqAI segmentation fault on Apple M2 (metal)

## Describe your environment * Operating system: MAC Sonoma 14.4.1 (23E224) * Python Version: Python 3.12.4 (`python -V`) * CCXT version: ccxt==4.3.79 (`pip freeze | grep ccxt`) * Freqtr…

svenkroll updated 3 weeks ago
7
OptimalScale/LMFlow #870

DPO+ZeRO train error

I would like to ask for your advice on the following two questions. 1. DPO train does not seem to support DeepSpeed ZeRO. After manually integrating `DPOAlignerArguments` with the `FinetunerArguments…

tankeui updated 2 months ago
2
roboticslibrary/rl #80

iiwa LBR 7 R800: EndEffectorPose is upside down

Hello and Thank you, i tested your library today with a KUKA FRI Connection. It works ofc, but using the given model file from rl-examples i cant get the correct Transformation of the end effector …

MightyMirko updated 8 months ago
10
vllm-project/vllm #5723

[RFC]: Add runtime weight update API

### Motivation. In online RL training, vLLM can significantly accelerate the rollout stage. To achieve this, we need weight sync from main training process to vLLM worker process, and then call the e…

lyuqin-scale updated 2 months ago
4
ray-project/ray #46119

rllib/examples/action_masking.py not working on dreamerV3

### What happened + What you expected to happen /ray/rllib/examples/action_masking.py modify: replace action_masking.py line 97 "ppo.PPOConfig()" with" dreamerv3.DreamerV3Config()" bug: Va…

moganli updated 2 months ago
2
werner-duvaud/muzero-general #185

MuZero Unplugged

Hey, I'm wondering if there is any intention to expand the code basis for MuZero unplugged to make it work in an offline RL setting?

tbskrpmnns updated 1 year ago
7

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for model-based-rl

1000+ results
for model-based-rl