llan-ml / maml-on-ray

3 stars 1 forks source link

Maml implementation #1

Open heraldiclily opened 5 years ago

heraldiclily commented 5 years ago

hello @llan-ml

Thank you for sharing this Maml implementation on ray. Have you been able to fix the raygeterror issue with ray support team? I will be gratefull if you could enrich this page with some implementation instructions.

llan-ml commented 5 years ago

I am using https://github.com/ray-project/ray/commit/20c7fad4f4426582da89765c607c487c875bc7ab now, and it works well without RayGetError. But the RLlib codes of this commit are not compatible with this codebase, and it should be easy to fix if you are familiar with RLlib codes (most cases are that APIs add new arguments).

As for Carla, I do not have Carla installed, and thus I cannot test with the Carla example. If you encounter any errors/bugs, feel free to ask.

smiler80 commented 5 years ago

hello @llan-ml,

Thank you for the shared code. Is it specific to some environments or generic? I would like to apply maml to gym env but the available codes are generally customized to certain application (halfCheetah, mountain car...). I went through your code and found no environment name, to which task did you apply it successfully?

llan-ml commented 5 years ago

hi @bbacem80 The code is developed based on Ray and RLlib.

We need to define a env_creator, register it with Ray, https://github.com/llan-ml/maml-on-ray/blob/6e58b1a65fca4b278d26f994b139fa3b233a91ac/main.py#L21-L22

The run_experiments will automatically pass the env name to the __init__ of the agent, and then the env name is used to construct PolicyEvaluator.

In principle, the MAML code is suitable to any environments as long as the environment conforms to RLlib design.

smiler80 commented 5 years ago

Thank you @llan-ml again for your explanations.

Since I'm beginner in RLLIB toolkit, and while I'm deepening my knowledge on the subject, I would be grateful if you can provide initiating instructions on how to run your code maml with a Gym environment, i.e. after the git clone of ray-project:

llan-ml commented 5 years ago

hi @bbacem80

The code in this repo was just used for debugging a ray bug before, and is obsolete now since the interfaces of RLlib have changed. Definitely, directing running the code will raise some errors (most cases are that new APIs use some new arguments).

Later, I will clean up my local codebase and upload a new version, but you can try this now.