learning-agent Search Results

1000+ results
for learning-agent

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

FlagOpen/FlagEmbedding #1064

About BGE-M3 finetune Distr

我的微调命令就是基于本仓库提供的示例 https://github.com/FlagOpen/FlagEmbedding/tree/master/examples/unified_finetune 微调命令： `export CUDA_VISIBLE_DEVICES=0,1 torchrun --nproc_per_node 2 \ -m FlagEmbedding.BGE_M3…

zhangbin1997 updated 2 months ago
1
microsoft/autogen #521

[Roadmap] Agents Self-Improvement/Learning/Optimization

### Describe the issue > [!TIP] > ## Want to get involved? > We'd love it if you did! Please get in contact with the people assigned to this issue, or leave a comment. See general contributing ad…

qingyun-wu updated 4 days ago
6
Kismuz/btgym #82

General discussion on State-of-the-Art Research

A lot of research in the field of RL is being done now days. I thought it can be both interesting and productive to have a post that would bring new research from time to time that might be relevant …

JaCoderX updated 4 years ago
13
IIC2613-Inteligencia-Artificial-2023-1/Syllabus #86

Tarea 4: Enternamiento de Reinforced Agent

Hola, estaba tratando de probar el código que hice para los agentes en la implementación de aprendizaje reforzado. Para ello ejecuté, primero el archivo test.py seleccionando a uno de los agentes con …

EnzoMorata updated 1 year ago
1
maxpumperla/deep_learning_and_the_game_of_go #102

Can i ignore Chapter4 5 6,just goto Chapter7?

seems the code in chapter 4 just show some function of search so i dont have to update my code written at chapter 1 2 3 ? thanks for somebody's answer in the future, love you!

tobeinged updated 2 years ago
2
IntelLabs/coach #237

Exploration access to environment for forward simulation

Hi, I stumbled upon the following potential improvement, I am hacking it right now, but it would be great to have a proper solution. MCTS and other forward simulation techniques must have access to…

redknightlois updated 5 years ago
4
deepdrive/deepdrive #20

Problem encountered during the training

Thank you for your great work. 1. I want to use the data set to train an agent, but the data set you provided are too large. so what's the minimum data set needed to get a satisfying result? 2. th…

Bailey02 updated 5 years ago
3
CVHvn/Mario_PPO_RND #1

Question about PPO

Hi, I’ve been using PPO to train an agent, and I noticed that the agent’s performance fluctuates even when it seems to have found an optimal policy. Specifically, after 200 episodes, the success rate …

asyua-ye updated 1 week ago
3
pytorch/pytorch #116354

Abstract torch.device for GPU/NE/TPU computations in the clo…

### 🚀 The feature, motivation and pitch Would be cool if Pytorch had smth like an agent, that we can spin up in the cloud, or even have a multi-user service, so instead of specifying cpu or gpu, we c…

evgenart updated 9 months ago
1
AI-Engineer-Foundation/agent-protocol #9

Needs GraphQL implementation

We've received feedback from an [advanced user](https://github.com/erik-megarad) (working on [AutoPack](https://github.com/AutoPackAI/autopack) and [Beebot](https://github.com/AutoPackAI/beebot)) that…

mlejva updated 1 year ago
2

上一页 1...92 93 94 95 96 97 98...100 下一页

1000+ results for learning-agent

1000+ results
for learning-agent