thu-ml / CRM

[ECCV 2024] Single Image to 3D Textured Mesh in 10 seconds with Convolutional Reconstruction Model.
https://ml.cs.tsinghua.edu.cn/~zhengyi/CRM/
MIT License
546 stars 44 forks source link

Is there any plan to release training script? #3

Open f1shel opened 6 months ago

f1shel commented 6 months ago

Hi, thanks for your wonderful paper and the released inference code!!! I am wondering if there is any plan to release the training script. Your reply will be highly appreciated~

SanketDhuri commented 6 months ago

I attempted to develop training for this, but the outcomes aren’t meeting my expectations (my goal was to overfit it for a single object, such as a shoe). Below given repo contains the code for training the model. This code is specifically designed to train 3D geometry, as I am deliberately inputting RGB images (manually captured) and CCM images (previously generated).

Repo link

please contribute to it if it has any mistakes / bug found

thuwzy commented 6 months ago

Thank you for your interest! The training code for the CRM is quite extensive, and it's difficult to organize it all in a short period. I will start by gradually uploading some of the utility functions I have organized.

SanketDhuri commented 6 months ago

Thank you for your interest! The training code for the CRM is quite extensive, and it's difficult to organize it all in a short period. I will start by gradually uploading some of the utility functions I have organized.

@thuwzy can you take a quick look to the training I have uploaded and plz identify mistakes I am making since I am not getting result as expected

thuwzy commented 6 months ago

Thank you for your interest! The training code for the CRM is quite extensive, and it's difficult to organize it all in a short period. I will start by gradually uploading some of the utility functions I have organized.

@thuwzy can you take a quick look to the training I have uploaded and plz identify mistakes I am making since I am not getting result as expected

Hi Sanket, I have read your code and found some problems. (1) the sdf should not be randomly initialized. Instead, it should be got from the output triplanes from the sdfMLP. (2) The learning rate is too high. I suggest 1e-4.