learning-agent Search Results

1000+ results
for learning-agent

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

silky/ideas #121

interactive wikipedia project

build an interactive learning layer on top of wikipedia examples 1. go to pages on economics, run simulations of various rational agent configurations and auctions 2. go to pages on ML, look at inter…

silky updated 8 years ago
1
FragileTech/FractalAI #29

[Suggestion- Extending Results] FAI as alternative uses-case…

- One of the big results in ["Learning to Plan Chemical Syntheses"](https://arxiv.org/abs/1708.04202) or ["Towards "AlphaChem": Chemical Synthesis Planning with Tree Search and Deep Neural Network Pol…

0bserver07 updated 6 years ago
3
meta-introspector/meta-meme #92

Emacspeak

Imagine an AGI that interacts with Emacs through Emacspeak, essentially pretending to be blind to provide a more human-like and accessible experience. This approach has the potential to enhance the in…

jmikedupont2 updated 1 year ago
2
gtri/scrimmage #332

Unable to convert function return value to a Python type

Answer the following questions: * what are you trying to do? * what is the problem and how can it be recreated? * what scrimmage commit are you on? You can see this with `git rev-parse HEAD` …

TianrongChen updated 5 years ago
1
hill-a/stable-baselines #645

Can an agent learn valid actions offline, being able to choo…

Hi, Can anyone give me advice on training an RL agent, that can choose actions only from a given data set. I am working on a control system problem. I have collected half a year worth of data ab…

VieVaWaldi updated 4 years ago
5
stardist/stardist #257

get the dist, points, scores from the def non_maximum_suppre…

Hi Stardist team, I am wondering how to access the dist, point and scores per object that is given by the non_maximum_suppression_inds function per object ? and how does it works ? The idea behi…

Nal44 updated 9 months ago
7
waffoo/accel #19

High Policy Loss in SAC_CQL

`policy_loss` in SAC_CQL is significantly higher than the official implementation when tested with `hopper-expert-v0` in d4rl. https://github.com/waffoo/accel/blob/af3f511ea816b2dd80346fe5a0b5e2b395c…

waffoo updated 3 years ago
1
philsupertramp/chess #3

Develop deep reinforcement model [Epic]

- [ ] #4 - [ ] #12 - [x] #5 - [ ] #6 - [ ] #7

philsupertramp updated 3 years ago
8
Farama-Foundation/Gymnasium #28

[Proposal] Tutorials

### Proposal To encourage the use of Gymnasium and build up the RL community, I would propose that a large range of tutorials are created. This is a list of tutorials that could be made - [x…

pseudo-rnd-thoughts updated 3 weeks ago
18
oxwhirl/pymarl #115

Inconsistent between code and pseudocode in agent input

Reading the pseudocode in paper [Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning](https://arxiv.org/abs/2003.08839) ![image](https://user-images.githubusercontent…

Ynjxsjmh updated 3 years ago
2

上一页 1...59 60 61 62 63 64 65...100 下一页

1000+ results for learning-agent

1000+ results
for learning-agent