reinforcement-learning-environments Search Results

812 results
for reinforcement-learning-environments

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

game-devs-csf/RL-environments #23

Finalizar juego football

- [ ] Definir colisiones que debe de haber entre jugadores y pelota. - [ ] Crear un Env para el aprendizaje por medio de self-play.

BassedWarrior updated 5 months ago
1
modumarl/proposal #5

20180417 회의록

MultiAgent RL ## 문제 설정 - 협동 => chase - 쫒는 애들은 MARL - 도망치는 애들은 룰기반 - combat? 싸움 알고리즘? - 평가? 룰기반 vs MARL 에이전트 잡는게 더 쉽다 축구는 적합한 상황이 아님. 패스 정도... ## 학습 방법론 - centralized - de…

jahyun-dev updated 6 years ago
2
ldoshi/rome-wasnt-built-in-a-day #213

Investigate epsilon and sweep hyperparameters for DQN

Trying to debug larger width environments (7 currently). Things to try: 1. Different metric (Average Q-value from 2015 paper https://arxiv.org/pdf/1312.5602.pdf). ``` 5.1 Training and Sta…

josephmaa updated 1 day ago
50
tillwenke/sumo_variable_speed_limits #4

Reinforcement learning models code not understandable

Please add some comments to the definition of the reinforcement learning models to understand, why they were designed like this. Checklist - [X] Modify `rl_gym_environments.py` ✓ https://github.co…

tillwenke updated 7 months ago
1
huggingface/deep-rl-class #394

Translating to Russian

Hi! Let's bring the reinforcement learning course to all the Russian-speaking community 🌏 Would you want to translate? Please follow the 🤗 [TRANSLATING guide](https://github.com/huggingface/tran…

blademoon updated 8 months ago
53
Farama-Foundation/HighwayEnv #583

custom environment

Hello! Can I control all the background vehicles myself, for example, by using a prediction model to output the trajectories of background vehicles, while the main vehicle uses IDM?

lihua12312 updated 5 months ago
3
huggingface/deep-rl-class #370

🌐 [i18n-KO] Translating rl-course to Korean

Hi! Let's bring the reinforcement learning course to all the Korean-speaking community 🌏 (currently 9 out of 77 complete) Would you want to translate? Please follow the 🤗 [TRANSLATING guide](ht…

wonhyeongseo updated 1 year ago
1
openai/procgen #90

invalid render mode human can not render"human" and "rgb_arr…

invalid render mode human File "C:\Users\hasee\AppData\Local\Programs\Python\python-3.9.13\Lib\site-packages\procgen\env.py", line 105, in __init__ raise Exception(f"invalid render mode {rende…

xiezhipeng-git updated 8 months ago
1
dennybritz/reinforcement-learning #252

MC Control with Epsilon-Greedy Policies ---Epsilon Value and…

- Epsilon value is not decreased hyperbolically At end of each episode ,there should be epsilion=epsilon/1.1

hardik-kansal updated 2 months ago
2
irthomasthomas/undecidability #731

LlamaGym: Online Reinforcement Learning for LLM-based agents…

- [ ] [LlamaGym/README.md at main · KhoomeiK/LlamaGym](https://github.com/KhoomeiK/LlamaGym/blob/main/README.md?plain=1) # LlamaGym/README.md at main · KhoomeiK/LlamaGym DESCRIPTION: Fine-tune LL…

irthomasthomas updated 6 months ago
1

上一页 1...1 2 3 4 5 6 7...82 下一页

812 results for reinforcement-learning-environments

812 results
for reinforcement-learning-environments