NVlabs / RVT

Official Code for RVT-2 and RVT
https://robotic-view-transformer-2.github.io/
Other
265 stars 32 forks source link

RVT-2 model fails to converge with real-world data for a simple task #54

Closed XiaohanLei closed 3 weeks ago

XiaohanLei commented 3 weeks ago

Content:

Problem Description

I'm attempting to train a RVT-2 model for a simple task: "lift the block". I've collected 10 demonstration samples in real-world scenarios for training, but the model shows no signs of convergence at all.

Environment

Attempts

So far, I've only tried training with the 10 collected samples.

Questions

  1. Is this issue primarily due to insufficient data?
  2. What other potential reasons could be causing the model to fail to converge?
  3. For such a simple task, approximately how many samples might be needed to see convergence?
  4. Are there any suggestions to improve the training process or data collection method?

Additional Information

1 2

the former is the pointcloud, and the latter is the rendered results

Any help or advice would be greatly appreciated!

imankgoyal commented 3 weeks ago

Hi,

Thanks for your interest in our work. It seems like you are unable to fit on the training data.

XiaohanLei commented 3 weeks ago

I discover that it is due to my dataset being too small, which result in the cosine learning rate not rising much before the training complete. In other words, the learning rate is too low. Thank you for your kind response.