PaulDaoudi / FOOD

Reimplementation of the FOOD algorithm in the Off-Dynamics Reinforcement Learning setting.
MIT License
1 stars 0 forks source link

Instruction to run the experiments #1

Open linhlpv opened 3 weeks ago

linhlpv commented 3 weeks ago

Hi @PaulDaoudi ,

Thank you for your amazing works! I'm trying to run your code and reproduce your experiments in the FOOD's paper. I have looked at readme file but couldn't find any instruction about it. Could you please help me and give some instruction to run the experiments and get FOOD's results.

Thank you so much and have a good day. Best. Linh

PaulDaoudi commented 3 weeks ago

Hi Linh,

I am glad that you took interest in our paper!

First, please note that the code is not entirely finished: some (minors) errors of my FOOD reimplementation (closely following the original code) must be corrected. It should be finished in the following weeks.

Second, the original trained policies are no longer available, but you can recreate them to gather the target trajectories and obtain the results. Here are the steps to follow:

I will keep you updated when the code will be entirely finished.

Thanks again for your patience and interest in our work!

Best,

Paul

linhlpv commented 2 weeks ago

@PaulDaoudi ,

Thank you for your reply.

As you mentioned about the original code, may I ask you where I can find it? Second, about training the policy to gather the source trajectories, how can I train the policy? Do you provide any source code for training and gathering (filtering bad trajectories too) in your repo?

Thank you again and have a nice day. Best, Linh