learning-agent Search Results

1000+ results
for learning-agent

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

crewAIInc/crewAI #1252

[BUG] Pydantic error with CrewAi + langchain_ollama

### Description I defined my llms as following: ` from crewai import Agent, Crew, Process, Task from crewai.project import CrewBase, agent, crew, task from langchain_ollama import ChatOllama …

widarr updated 2 days ago
6
BradenEverson/earthmover #1

HiveMind: First Steps

The HiveMind interface will be a web server that accepts connections via websocket. The agent will connect to the HiveMind and stream it's binary and serialized session to it accordingly. For these fi…

BradenEverson updated 1 week ago
4
LisandraMoura/Mario-kart-RL #2

Ambiente de simulação Mario Kart

#### Testes iniciais - [x] Escolher um repositório de Mario Kart 64 para base de comparação - [x] Testar e ver o funcionamento do repositório - [x] Estudar o repositório (coleta de parâmetros usados)…

LisandraMoura updated 3 days ago
3
adithya-s-k/World-of-AI #51

Stock Market Trading Agent Using Deep Reinforcement Learning

## Project Request The project aims to develop a Stock Market Trading Agent using Deep Reinforcement Learning. --- | Field | Description | | ------ | -----------------…

ayush-09 updated 1 year ago
2
riebl/artery #346

Creating an executable instead of using build

Hello, I am using Reinforcement Learning with Artery and wanted to integrate veins-gym. Based on the example provided [here](https://github.com/ComNetsHH/omnetpp-ml/blob/main/docs/openai_gym.md), I…

Rom-1T updated 3 weeks ago
1
TabbyML/tabby #3263

VSCode code inline completion with wrong indentation size

**Describe the bug** Sometimes, the code completion hint is not formatted according to the context above or below. ![image](https://github.com/user-attachments/assets/7a5e30e6-1432-4511-be62-ba0df…

Sma1lboy updated 4 days ago
2
girlscript/winter-of-contributing #6366

Data Science with Python: Multi-agent reinforcement learning

### Description Welcome to the 'DSWP' Team, good to see you here. With this issue, readers will get introduced to the core information about 'Multi-agent reinforcement learning'. To get assigne…

Pushpit07 updated 2 years ago
5
intelligent-machine-learning/dlrover #1290

add xpu monitor for dlrover

# Background Dlrover is an elastic deep learning framework, with fault-tolerance of processes failure, POD losting etc. Since the LLM training is at large scale and always span for a long time, many …

majieyue updated 5 days ago
1
act3-ace/safe-autonomy-sims #26

initial inspected points not counted towards reward

on env reset self.prev_weight_inspected is set to 0.0 but on each step, it is set to self.chief.inspection_points.get_total_weight_inspected() before taking any action. So any points seen on initializ…

JohnMcCarroll updated 6 hours ago
1
ldoshi/rome-wasnt-built-in-a-day #213

Investigate epsilon and sweep hyperparameters for DQN

Trying to debug larger width environments (7 currently). Things to try: 1. Different metric (Average Q-value from 2015 paper https://arxiv.org/pdf/1312.5602.pdf). ``` 5.1 Training and Sta…

josephmaa updated 5 days ago
57

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for learning-agent

1000+ results
for learning-agent