-
Hello. Thank you for your amazing work. I appreciate the efforts to provide a unified library of MARL algorithms and environments for benchmarking and reproducibility. To better achieve this goal, I s…
-
Thank you for the work. I recently start working on reinforcement learning of mathematical research (with the formal language and deduction system of a proof assistant as the environment); it's not st…
-
It would be interesting to port a few basic communication environments/training procedures to Flow. In particular, a popular communications baselines is "Learning Multi-agent Communication with Backpr…
-
If you want to become a reviewer for ReScience, please post your information here. The format is:
```
[name](github account link)
Scientific expertise - Language expertise
ORCID: [xxxx](http…
-
The lables here at keras-gcn does not seem to corresponds with the labels of the gcn repository when you load the data. It's the same indices, but not the same values.
Also if you sum y_train here, …
-
# Reinforcement learning for 3D Volleyball game
## Team members:
- Tamara Ilić SV45/2020 group 3
- Uroš Poček SV57/2020 group 3
## Assistant:
- Branislav Andjelic
## Problem being solved:
…
-
I did the third step of PPO training, it was time consuming and unstable. The reward observed during training is between -300 and -10 as follows. Is this situation normal? What does a good PPO trainin…
-
Hello, as a newcomer to reinforcement learning, I have some questions I would like to ask.
1. Are the hyperparameters in the model suitable for all scenarios?
2. When I was training the model with y…
-
Hello everyone,
I think I've finally figured out how to send a simple float feature vector down to a TensorFlow model, do some predictions on it and retrieve it back. It doesn't seem to be be very …
-
Ensemble Kalman Filter (EnKF) for Reinforcement Learning (RL). (arXiv:2107.01244v1 [eess.SY])
https://ift.tt/2SUDfTO
This paper is concerned with the problem of representing and learning the optimal c…