-
build an interactive learning layer on top of wikipedia
examples
1. go to pages on economics, run simulations of various rational agent configurations and auctions
2. go to pages on ML, look at inter…
silky updated
8 years ago
-
- One of the big results in ["Learning to Plan Chemical Syntheses"](https://arxiv.org/abs/1708.04202) or ["Towards "AlphaChem": Chemical Synthesis Planning with Tree Search and Deep Neural Network Pol…
-
Imagine an AGI that interacts with Emacs through Emacspeak, essentially pretending to be blind to provide a more human-like and accessible experience. This approach has the potential to enhance the in…
-
Answer the following questions:
* what are you trying to do?
* what is the problem and how can it be recreated?
* what scrimmage commit are you on? You can see this with `git rev-parse HEAD`
…
-
Hi,
Can anyone give me advice on training an RL agent, that can choose actions only from a given data set.
I am working on a control system problem. I have collected half a year worth of data ab…
-
Hi Stardist team,
I am wondering how to access the dist, point and scores per object that is given by the non_maximum_suppression_inds function per object ? and how does it works ?
The idea behi…
Nal44 updated
9 months ago
-
`policy_loss` in SAC_CQL is significantly higher than the official implementation when tested with `hopper-expert-v0` in d4rl.
https://github.com/waffoo/accel/blob/af3f511ea816b2dd80346fe5a0b5e2b395c…
-
- [ ] #4
- [ ] #12
- [x] #5
- [ ] #6
- [ ] #7
-
### Proposal
To encourage the use of Gymnasium and build up the RL community, I would propose that a large range of tutorials are created.
This is a list of tutorials that could be made
- [x…
-
Reading the pseudocode in paper [Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning](https://arxiv.org/abs/2003.08839)
![image](https://user-images.githubusercontent…