eureka-research / Eureka

Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)
https://eureka-research.github.io/
MIT License
2.73k stars 244 forks source link

thoughts and follow-up questions after paper read #32

Open ramkumarkoppu opened 7 months ago

ramkumarkoppu commented 7 months ago

Hi, Today I was reading your paper, and have few questions to ask:

  1. model is frozen after training by the Eureka (along with it's GPT-4 API), when you deploy this model on the robot, does it need access to GPT-4 API to do inference and change it's behavior as real world environment changes?
  2. How do we incorporate explainability into Eureka if we want to understand why specific specific policy chosen?
  3. How easy it is incorporate robot multi sensors data in robot perception for affect it's actions?
  4. How easy it is to scale Eureka for Co-operative Multi agent (robot) systems?