Closed SteveImmanuel closed 1 year ago
Thank you for your interest in our paper.
Thank you for your clarification. One small follow-up for number 2, do the 14,400 scenes come from a single episode or multiple episodes? By single episode, I mean from the starting state (the agent starts to move) until the end state (task achieved/failed).
We got the data from 120 episodes (120 timesteps per episode). Thank you.
Thank you
Hi, Nice work on the paper.
I have some questions about the NeRF pre-training (stage 1). In your paper, you mentioned that the offline datasets consist of 14400 scenes where each scene consists of 3 images from different views. You also use 4 different environments, e.g. window-open-v2, soccer-v2, hammer-v2, drawer-open-v2. Could you please elaborate the followings:
Thank you.