learning-agents Search Results

1000+ results
for learning-agents

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Cattharine/product_owner_rl #63

Add new RL algorithm to compare with baseline (SAC Discrete)

Our current baseline RL algorithm is DQN (more accurately it is DDQN). Named algorithm uses epsilon-greedy policies to at least have a chance of fully investigating environment in question. Using epsi…

Cattharine updated 4 weeks ago
2
SakanaAI/AI-Scientist #71

process stops progressing after reaching "generating idea 2…

Is the process stopping because I requested only 2 ideas to be generated? I'm also curious about how to obtain the full paper. I've been waiting for an hour, and the GPT API usage has been stu…

clean-e2map updated 3 months ago
3
crewAIInc/crewAI #1252

[BUG] Pydantic error with CrewAi + langchain_ollama

### Description I defined my llms as following: ` from crewai import Agent, Crew, Process, Task from crewai.project import CrewBase, agent, crew, task from langchain_ollama import ChatOllama …

widarr updated 4 days ago
8
microsoft/FLAML #1064

Support In-Context-Learning (ICL) in agents

Scenario: Interactive example selection. More specifically, the`AssistantAgent` can ask for examples anytime during the interaction with the `UserProxyAgent` ```[tasklist] ### Tasks - [x] Review …

qingyun-wu updated 1 year ago
4
irthomasthomas/undecidability #731

LlamaGym: Online Reinforcement Learning for LLM-based agents…

- [ ] [LlamaGym/README.md at main · KhoomeiK/LlamaGym](https://github.com/KhoomeiK/LlamaGym/blob/main/README.md?plain=1) # LlamaGym/README.md at main · KhoomeiK/LlamaGym DESCRIPTION: Fine-tune LL…

irthomasthomas updated 8 months ago
1
Coding-Connoisseur/AI-Team #10

Agent Collaboration and Feedback Loop

**User Story**: Agent Collaboration and Feedback Loop **Tasks**: - Enable agents to collaborate on complex tasks (Due: 2024-12-12)

Coding-Connoisseur updated 1 month ago
1
Unity-Technologies/ml-agents #6171

ValueError: 'Elo' is not a valid CompletionCriteriaSettings.…

**Describe the bug** [Elo is supposed to be a valid measure for curriculum learning](https://github.com/Unity-Technologies/ml-agents/blob/200fe54e14b649d6eac66a7f0779c1086c506919/docs/Training-ML-Age…

jporubci updated 1 day ago
2
TMats/survey #57

Imagination-Augmented Agents for Deep Reinforcement Learning

https://arxiv.org/abs/1707.06203

TMats updated 7 years ago
2
google/brax #537

Best Practice for Passing/Storing Training Progress for Curr…

Hi Brax team, I’m working on a reinforcement learning project using Brax to train a PPO agent and I’m trying to implement curriculum learning by adjusting the environment's difficulty dynamically bas…

hukz18 updated 1 day ago
2
hpi-sam/Robust-Multi-Agent-Reinforcement-Learning-for-SAS #16

summarize discussion of transfer learning between different …

jocodeone updated 2 years ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for learning-agents

1000+ results
for learning-agents