-
CI test **linux://rllib:learning_tests_multi_agent_cartpole_ppo_multi_gpu** is flaky. Recent failures:
- https://buildkite.com/ray-project/postmerge/builds/5091#019048f0-d3fa-4e73-a14c-666a03aa0ea8
…
-
https://arxiv.org/pdf/2303.11366.pdf
![Screenshot 2024-04-04 at 12 16 20 PM](https://github.com/Aidenzich/road-to-master/assets/57204353/ab4db2ed-d47f-4729-8ada-d3458f709af9)
![IMG_0969](https://g…
-
### Is there an existing issue for this?
- [X] I have searched the existing issues
### Feature Description
The project aims to develop a reinforcement learning (RL) agent to optimize waste collecti…
-
### Self Checks
- [X] I have searched for existing issues [search for existing issues](https://github.com/langgenius/dify/issues), including closed ones.
- [X] I confirm that I am using English to su…
-
The following parameters will be considered for the model:
- learning rate
- exploration rate
- discount rate
make sure to explain in the comments what every parameters means and how they affe…
-
We want to add support for an authenticated communication between agents, so the bots can share knowledge and converge to the optimal QTable more quickly.
IPv8 will be used for communication and pr…
-
**As an** agent
**I want to** be able to use Q-Learning to use as a strategy
**so that** I can play snake
## Acceptance Criteria
### AC1
Given I am starting to play a game of snake
When I do n…
-
# (Quant Trading Question!) Using RD-Agent to find and optimize profitable strategies?
I wonder if you have any examples, or if no examples yet, can clarify if you think this api is READY for this…
-
Is the process stopping because I requested only 2 ideas to be generated?
I'm also curious about how to obtain the full paper.
I've been waiting for an hour, and the GPT API usage has been stu…
-
Very fast agent learning and training, and i understand that it maybe for testing reasons but when realease will come its will be an issue.
Learning speed should be reduced in 4 times)
(notice it at…