Question about masked car

georghess / voxel-mae

Code for the paper "Masked Autoencoders for Self-Supervised Learning on Automotive Point Clouds"

Apache License 2.0

74 stars 6 forks source link

Hi, Thanks for your interest!

1) I am not sure I understand the question. By "predict the car" do you mean predict its location or predict the points corresponding to the car? For the pre-training, we remove most of the point cloud and then ask the network to reconstruct the masked parts. For instance, this could be to reconstruct the point cloud of a car given only parts of the car. For the fine-tuning, we train the model on the entire point cloud and supervise it with regular 3DOD.

2) The prior for the points are their corresponding voxel centers, e.g., the network predicts the offset for the points within each voxel.

georghess / voxel-mae

Question about masked car #2