Closed liops-seungyeob closed 3 months ago
Since the input size of the Diffusion Policy is fixed, the output of the Point Cloud Encoder must also consistently return a fixed value.
It appears that the DP3 and PointNet encoders use Max Pooling to achieve this.
For other encoders, what methods were used to match the input size of the Diffusion Policy?
Hi, thank you for your interest. For other encoders, they all output a latent vector for downstream tasks, so it is similar.
Since the input size of the Diffusion Policy is fixed, the output of the Point Cloud Encoder must also consistently return a fixed value.
It appears that the DP3 and PointNet encoders use Max Pooling to achieve this.
For other encoders, what methods were used to match the input size of the Diffusion Policy?