gridworld-environment Search Results

204 results
for gridworld-environment

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Unity-Technologies/ml-agents #4992

Unexpected getSteps Behavior with Python API

**Describe the bug** I'm getting weird obs back. Sometimes I'm getting only 1 TerminalSteps und 0 DecisionSteps, sometimes 2 TS and 0 DS and the even worse part, I'm getting sometimes 12 (when having…

PeterKeffer updated 2 years ago
6
salesforce/warp-drive #31

Error: Invalid Resource handle.

Hello WarpDrive Team, A good MARL library indeed. I have tried this library on an old machine and it works fine. However, when I moved to a new machine, I met the following error. ``` (warp_…

Ma-Weijian updated 2 years ago
2
MushroomRL/mushroom-rl #86

Categorical Policy for Discrete Action Spaces?

I want to explore policy gradient and actor critic agents on `GridWorld` environments. To that end, I want to parameterize the policy as a Categorical distribution at each state. How do I do this? …

RylanSchaeffer updated 2 years ago
9
HumanCompatibleAI/overcooked_ai #42

Add Dynamic Recipes

Update the `OvercookedState` and `OvercookedGridworld` classes to include recipe lists that can change with time. This would involve adding timestep dependencies to `all_orders` and `bonus_orders`

nathan-miller23 updated 2 years ago
4
MushroomRL/mushroom-rl #87

Tutorial for REINFORCE

I'm trying to implement a simple REINFORCE agent on `Gridworld`. However, I keep hitting the following error: ``` File "/home/rylan/Documents/GanguliGang-Metacognitive-Actor-Critic/mac_venv/lib/…

RylanSchaeffer updated 2 years ago
2
mashimashica/WM2021_LWM #6

2Dゲーム環境の再現

mashimashica updated 2 years ago
7
lmzintgraf/varibad #9

the policy perform bad in my own env

I'm using the code for my own env which has no time limit but has max episode step limit. and my best action maybe about -0.5. But I found that my action would beyound the limitation [-1,1] a lot. I…

wenzhoulyu updated 2 years ago
3
salesforce/warp-drive #10

Some question about the environment

I think the idea of environment scheduling is very novel. Multi-environment and multi-agent are scheduled on GPU, which improves GPU utilization ratio. I have some questions about the `tag-continuous…

ghost updated 2 years ago
3
rlberry-py/rlberry #84

a_idx2str up and down inverted in GridWorld ?

Hi, I found something weird in the controls in GridWorld. It seems like up and down are inverted: I used the first cells of the Google Colab tutorial in Google Colab: ```Python from IPython import …

theovincent updated 2 years ago
1
ray-project/ray #20112

Policy Mapping Function based on Environment Observation

Is it possible to create an agent which uses different policies depending on an observation? For example, in a hypothetical Windy Gridworld environment where the wind can change direction spontaneousl…

leeykang updated 3 years ago
2

上一页 1...6 7 8 9 10 11 12...21 下一页

204 results for gridworld-environment

204 results
for gridworld-environment