learning-agent Search Results

1000+ results
for learning-agent

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ray-project/ray #46226

CI test linux://rllib:learning_tests_multi_agent_cartpole_pp…

CI test **linux://rllib:learning_tests_multi_agent_cartpole_ppo_multi_gpu** is flaky. Recent failures: - https://buildkite.com/ray-project/postmerge/builds/5091#019048f0-d3fa-4e73-a14c-666a03aa0ea8 …

can-anyscale updated 28 minutes ago
33
Aidenzich/road-to-master #48

Reflexion: Language Agents with Verbal Reinforcement Learnin…

https://arxiv.org/pdf/2303.11366.pdf ![Screenshot 2024-04-04 at 12 16 20 PM](https://github.com/Aidenzich/road-to-master/assets/57204353/ab4db2ed-d47f-4729-8ada-d3458f709af9) ![IMG_0969](https://g…

Aidenzich updated 6 months ago
3
recodehive/machine-learning-repos #1533

💡[Feature]: Effective Waste Management using Reinforcement L…

### Is there an existing issue for this? - [X] I have searched the existing issues ### Feature Description The project aims to develop a reinforcement learning (RL) agent to optimize waste collecti…

Panchadip-128 updated 6 hours ago
1
langgenius/dify #8707

Suggestion to Add Custom Multi-turn Dialogues for Few-shot L…

### Self Checks - [X] I have searched for existing issues [search for existing issues](https://github.com/langgenius/dify/issues), including closed ones. - [X] I confirm that I am using English to su…

neotize updated 4 weeks ago
2
a-maa/RL15 #3

(Renamed) Agent heading to sub-goal before reaching end

The following parameters will be considered for the model: - learning rate - exploration rate - discount rate make sure to explain in the comments what every parameters means and how they affe…

MinimalistSwan updated 1 week ago
3
Tribler/Dollynator #50

Cooperative multi-agent reinforcement learning

We want to add support for an authenticated communication between agents, so the bots can share knowledge and converge to the optimal QTable more quickly. IPv8 will be used for communication and pr…

MattSkala updated 5 years ago
1
DiscoverAI/pungi #2

create agent with q learning

**As an** agent **I want to** be able to use Q-Learning to use as a strategy **so that** I can play snake ## Acceptance Criteria ### AC1 Given I am starting to play a game of snake When I do n…

meandor updated 5 years ago
1
microsoft/RD-Agent #381

(Quant Trading Question!) Using RD-Agent to find and optimiz…

# (Quant Trading Question!) Using RD-Agent to find and optimize profitable strategies? I wonder if you have any examples, or if no examples yet, can clarify if you think this api is READY for this…

keithorange updated 3 weeks ago
7
SakanaAI/AI-Scientist #71

process stops progressing after reaching "generating idea 2…

Is the process stopping because I requested only 2 ideas to be generated? I'm also curious about how to obtain the full paper. I've been waiting for an hour, and the GPT API usage has been stu…

clean-e2map updated 1 month ago
3
OpenApoc/OpenApoc #613

Very fast agent learning and training

Very fast agent learning and training, and i understand that it maybe for testing reasons but when realease will come its will be an issue. Learning speed should be reduced in 4 times) (notice it at…

makus82 updated 1 year ago
7

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for learning-agent

1000+ results
for learning-agent