Training Data - How to use?

@wenbowen123 Thanks for your great work! I'm trying to fine-tune the Refinement and Selection models to use RGB input only using the training data you provided.

It appears the training dataset consists of pairs of renders of the same scene with bbox, K, distance to image plane, instance segmentation, and RGB render. Do you utilize one of the pair as the rendering and the other as the input observation for the pose refinement training?

It would be great if you could describe how the training dataset you provide is used.

NVlabs / FoundationPose

Training Data - How to use? #99