Many thanks for the great work and the nice open-source.
During trying your code, I noticed that the embeding diomension of the point cloud encoder is set to be 1024, which is to match the dimension of the EVA-CLIP. However, I would like to try for some smaller versions of CLIP(as also shown in your paper) which only supports 512 dimention, is it possible to provide the pretrained Uni3D with 512 embedding dimension?
Dear Author,
Many thanks for the great work and the nice open-source.
During trying your code, I noticed that the embeding diomension of the point cloud encoder is set to be 1024, which is to match the dimension of the EVA-CLIP. However, I would like to try for some smaller versions of CLIP(as also shown in your paper) which only supports 512 dimention, is it possible to provide the pretrained Uni3D with 512 embedding dimension?
Thanks for the reply in advance.