-
- [ ] Definir colisiones que debe de haber entre jugadores y pelota.
- [ ] Crear un Env para el aprendizaje por medio de self-play.
-
MultiAgent RL
## 문제 설정
- 협동 => chase
- 쫒는 애들은 MARL
- 도망치는 애들은 룰기반
- combat? 싸움 알고리즘?
- 평가? 룰기반 vs MARL 에이전트
잡는게 더 쉽다
축구는 적합한 상황이 아님. 패스 정도...
## 학습
방법론
- centralized
- de…
-
Trying to debug larger width environments (7 currently).
Things to try:
1. Different metric (Average Q-value from 2015 paper https://arxiv.org/pdf/1312.5602.pdf).
```
5.1 Training and Sta…
-
Please add some comments to the definition of the reinforcement learning models to understand, why they were designed like this.
Checklist
- [X] Modify `rl_gym_environments.py` ✓ https://github.co…
-
Hi!
Let's bring the reinforcement learning course to all the Russian-speaking community 🌏
Would you want to translate? Please follow the 🤗 [TRANSLATING guide](https://github.com/huggingface/tran…
-
Hello! Can I control all the background vehicles myself, for example, by using a prediction model to output the trajectories of background vehicles, while the main vehicle uses IDM?
-
Hi!
Let's bring the reinforcement learning course to all the Korean-speaking community 🌏 (currently 9 out of 77 complete)
Would you want to translate? Please follow the 🤗 [TRANSLATING guide](ht…
-
invalid render mode human
File "C:\Users\hasee\AppData\Local\Programs\Python\python-3.9.13\Lib\site-packages\procgen\env.py", line 105, in __init__
raise Exception(f"invalid render mode {rende…
-
- Epsilon value is not decreased hyperbolically
At end of each episode ,there should be epsilion=epsilon/1.1
-
- [ ] [LlamaGym/README.md at main · KhoomeiK/LlamaGym](https://github.com/KhoomeiK/LlamaGym/blob/main/README.md?plain=1)
# LlamaGym/README.md at main · KhoomeiK/LlamaGym
DESCRIPTION:
Fine-tune LL…