Fix Pathmind Simulation

PathmindAI / nativerl

Train reinforcement learning agents using AnyLogic or Python-based simulations

Apache License 2.0

19 stars 4 forks source link

Fix Pathmind Simulation #425

Closed kepricon closed 2 years ago

kepricon commented 2 years ago

I tested with Ed's model(added getRewardTerms() into Mdoe's Simulation.py) model_py_simulation.zip

curl -i -XPOST -H "X-PM-API-TOKEN: 11202253-5709-4eb7-9102-f87122314464" -F 'file=@/home/kepricon/Downloads/python_examples.zip' -F 'projectId=500' -F 'env=examples.mouse.single_agent_mouse_env.MouseAndCheese' -F 'start=TRUE' http://localhost:8081/py/upload

Here are Training results

https://s3.console.aws.amazon.com/s3/buckets/dh-training-dynamic-files.pathmind.com?region=us-east-1&prefix=id3086/output/&showversions=false

slinlee commented 2 years ago

@ejunprung the single agent was working already, right? we need the multi agent to work now?

ejunprung commented 2 years ago

@slinlee It's not related to single vs multi-agent. I was testing the gym environment versus Pathmind simulation environment. I just happened to have the multi-mouse example ready to go at the time so I was using that for testing. It should still work fine, I haven't see any issues around multi-agent so far.

slinlee commented 2 years ago

@ejunprung so what works right now?

gym env - works?
pathmind simulations - works?

ejunprung commented 2 years ago

Still figuring that out. Need another day or two to finish testing so I'll compile a list after that. But so far, gym works fine. Pathmind simulations is broken but I still need to test Dae's fix.

Multi-agent mechanically should work but it's missing features (e.g. skip) so it won't be usable in practice.

slinlee commented 2 years ago

k. yeah nice, def keep a list of the small things that need to be added

slinlee commented 2 years ago

run tests

ejunprung commented 2 years ago

@slinlee I think we need to add this fix to test environment. I still can't get my local py-nativerl working correctly. Do I need to build a new NativeRL or is that automated now?