model-free-rl Search Results

1000+ results
for model-free-rl

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Wunder2dream/RL #1

Model free RL

- **Model-Free vs Model-Based RL** >1)**Model-based** algorithmis an algorithm that uses _the transition function_ (and _the reward function_) in order to estimate the optimal policy. > The age…

Wunder2dream updated 3 years ago
7
Not-Nik/raylib-zig #99

Can someone check if shaders_mesh_instancing example works f…

Can someone check [shaders_mesh_instancing](https://www.raylib.com/examples/shaders/loader.html?name=shaders_mesh_instancing) example works from zig? I have the following code based on the example abo…

nitanmarcel updated 2 weeks ago
3
RobLoach/raylib-cpp #299

double free from Shaders

When loading the Models with Mesh the ownership is not clear and the mash(es)(?) got double freed. Logs ```bash INFO: TEXTURE: [ID 2] Texture loaded successfully (128x128 | GRAY_ALPHA | 1 mipmaps…

furudbat updated 9 hours ago
7
OptimalScale/LMFlow #870

DPO+ZeRO train error

I would like to ask for your advice on the following two questions. 1. DPO train does not seem to support DeepSpeed ZeRO. After manually integrating `DPOAlignerArguments` with the `FinetunerArguments…

tankeui updated 5 days ago
2
MolecularAI/REINVENT4 #107

CUDA out of memory issue when running RL with mol2mol_simila…

I'm encountering a CUDA out of memory error while running reinforcement learning (RL) using the mol2mol_similarity model on a SageMaker **ml.g5.xlarge** instance (**24GB GPU memory**). The RL process …

patricia-rocha updated 7 hours ago
10
Geonhee-LEE/mobile_robot_control #10

Package architecture

Package architecture: - controllers: > class control: PID, pure pursuit, bang-bang, open-loop(velocity profile), ... > optimal control: lqr, ddp, mpc, ... > collision avoidance: RVO, O…

Geonhee-LEE updated 2 years ago
1
projectmesa/mesa #2142

huggingface hub integration

I have seen that https://gradio.app/ is used in the UIs for Hugging Face. @wang-boyu have you looked into it, since it is listed in one of the possible frameworks to use in the GSoC wiki? See also ht…

adamamer20 updated 1 month ago
4
histmeisah/Large-Language-Models-play-StarCraftII #11

agent keeps losing the games

I was trying to reproduce the experiment and test the various LLMs. I tried both Chatgpt-4 and chatgpt-3.5, and set the game difficulty to 2. I run the game for 3 to 4 times and LLM lost all of them. …

detectiveron updated 2 weeks ago
7
yandexdataschool/Practical_RL #84

Broken links

I used the following two commands to identify broken links. `markdown-link-check` is https://github.com/tcort/markdown-link-check ``` bash find ./Practical_RL/ -type f -name '*.ipynb' -exec jupyt…

yagudin updated 4 years ago
9
facebookresearch/darkforestGo #37

How much memory do I need to train

Hi, First of all, thanks to your nice work. I was trying to run your go engine on my server, which has about 120 GiB memory. It went all right until I tried to train with provided dataset. …

HongweiQin updated 6 years ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for model-free-rl

1000+ results
for model-free-rl