georghess / voxel-mae

Code for the paper "Masked Autoencoders for Self-Supervised Learning on Automotive Point Clouds"
Apache License 2.0
74 stars 6 forks source link

Question about masked car #2

Closed sky-fly97 closed 2 years ago

sky-fly97 commented 2 years ago

Hello, it's a really great job. I have two question: 1) if you take most of the point cloud of a car in the scene, then can you predict the car? 2) The first task is to reconstruct a fixed number of point coordinates in every voxel, which is equivalent to getting the coordinates of so many points without any priors, and only by chamfer distance loss?

georghess commented 2 years ago

Hi, Thanks for your interest!

1) I am not sure I understand the question. By "predict the car" do you mean predict its location or predict the points corresponding to the car? For the pre-training, we remove most of the point cloud and then ask the network to reconstruct the masked parts. For instance, this could be to reconstruct the point cloud of a car given only parts of the car. For the fine-tuning, we train the model on the entire point cloud and supervise it with regular 3DOD.

2) The prior for the points are their corresponding voxel centers, e.g., the network predicts the offset for the points within each voxel.