reinforcement-learning Search Results

microsoft/autogen #4282

Dynamic Agent Composition with Reinforcement Learning

### What feature would you like to be added? Implement a system for dynamically composing and optimizing agent workflows using reinforcement learning (RL) techniques (this feature can be integrated…

peymanrahi updated 19 hours ago

Niketkumardheeryan/ML-CaPsule #1142

Waste Management through Reinforcement Learning

The project aims to develop a reinforcement learning (RL) agent to optimize waste collection in a simulated environment, minimizing overflow events and improving efficiency. Environment and State R…

Panchadip-128 updated 4 weeks ago

GarimaSingh0109/WasteManagment #359

[Feature] Waste Management through Reinforcement Learning te…

### Description The project aims to develop a reinforcement learning (RL) agent to optimize waste collection in a simulated environment, minimizing overflow events and improving efficiency. Environm…

Panchadip-128 updated 2 weeks ago

HKU-Smart-Mobility-Lab/Transportation_Simulator #40

Learning to delay in ride-sourcing systems: a multi-agent de…

最近在关注这篇文章的思路，关于订单如果来源于不同平台处理方式应该是不太一样~ 其实增加了很多需要思考的维度，不知道这篇文章Learning to delay in ride-sourcing systems: a multi-agent deep reinforcement learning framework的相关代码是否可公开~ 希望有机会能够沟通一下~ 感谢

GuangwenSi updated 3 days ago

PySymGym/PySymGym #135

Cleaning up artifacts from the reinforcement learning

- [ ] Clean up client side - [ ] Clean up servers side - [ ] usvm - [ ] V#

Parzival-05 updated 4 weeks ago

YangletLiu/CSCI4961_labs_projects #60

More quantum-related modules

Quantum-inspired - Reinforcement learning for quantum circuit design (Allan, Zhiyuan, Thomas) - Reinforcement learning for drug discovery (Dannong) - Approximating wave functions using transformer-…

Chriun updated 1 day ago

dotnet/machinelearning #181

I've looked into the available documentation and examples, but haven't been able to figure out if it is possible to use the ML.NET in its current state for (non-deep) reinforcement learning. If it is …

jarnmo updated 1 month ago

schwallergroup/saturn #4

Error with Replay Buffer

Hi Jeff, When trying to run Saturn I get the following error message for the replay buffer. ``` Traceback (most recent call last): File "/app/saturn/saturn.py", line 76, in reinforceme…

JanoschMenke updated 3 hours ago

isaac-sim/IsaacLab #1445

[Question] QuadCopter training with camera

Hi, Apologies if this question has been asked before or if the answer is obvious. I’m still relatively new to reinforcement learning, robotics, and simulation, having started just a month ago. I…

JulienHansen updated 6 hours ago

huggingface/trl #2326

Support for MiniCPM-V Reinforcement Learning with Direct Pre…

### Feature request Hi! I’d like to request support for reinforcement learning with DPO for the MiniCPM-V model. I'm not sure if the current state of this repository enables for this vision model to …

DarioPTWR updated 1 week ago

1000+ results for reinforcement-learning

1000+ results
for reinforcement-learning