-
-
On page 204+ (chapter 8) of Sutton (Reinforcement Learning, 2nd edition) Sutton presents tile coding. I do not believe tile algorithms are already included in this repo. I think the library would bene…
-
To implement the reinforcement learning algorithms like A3C, directly setting the gradient of `Parameters` and `LookupParameters` will be necessary. e.g. `Parameters.set_grad(self, array)`
Further,…
-
In terms of functionality, the mid-term end goal is to achieve an offering of ML algorithms and pre-processing routines comparable to what is currently available in Python's [`scikit-learn`](https://s…
-
-
https://arxiv.org/abs/1804.04603
```
Image segmentation needs both local boundary position information and global object context information. The performance of the recent state-of-the-art metho…
-
Dear Feiyun,
I've been reading your paper,
[Cohesion-based Online Actor-Critic Reinforcement Learning for mHealth Intervention](https://arxiv.org/pdf/1703.10039.pdf),
with much interest. I wo…
-
I have referred to some people's work on adding RNNs to reinforcement learning algorithms, but strangely, almost everyone's code implementation is different. So I would like to ask how you integrate L…
-
> Der nächste Schritt wäre einen Agenten mit zwei Optimierungsalgorithmen zu trainieren. Hierfür könnten Sie im Reinforcement Learning-Bereich den PPO und DQN Algorithmus verwenden. Sie könnten aber a…
-
There's a culture in ML of authors making their textbooks available online (to supplement the traditional print editions), which is extremely beneficial to students & researchers. The following is a l…
ghost updated
5 years ago