ken2576 / vision-nerf

Official PyTorch Implementation of paper "Vision Transformer for NeRF-Based View Synthesis from a Single Input Image", WACV 2023.
MIT License
105 stars 12 forks source link

question about training #21

Open crazy-stycxj opened 1 month ago

crazy-stycxj commented 1 month ago

HI! I repeat the training process,and find the loss like this ,Is this normal? It's like the whole workout was a shock at the end. The whole process took 52 hours and the graphics card used was an RTX3090,24GB Also can you give the code for the evaluation metrics for the test? thanks!

image

ken2576 commented 1 month ago

I think it looks ok. Usually with this kind of dataset, the loss would fluctuate heavily because the amount of unseen areas is different each sample. The eval script should be here. https://github.com/ken2576/vision-nerf/blob/main/eval.py

crazy-stycxj commented 1 month ago

I think it looks ok. Usually with this kind of dataset, the loss would fluctuate heavily because the amount of unseen areas is different each sample. The eval script should be here. https://github.com/ken2576/vision-nerf/blob/main/eval.py

OK,thanks,I run the 【eval.py】。I find this code only output pics,not 【Assessment of indicators】,i mean how to comput 【PSNR,LILPS SSIM 】 like this

crazy-stycxj commented 1 month ago

I think it looks ok. Usually with this kind of dataset, the loss would fluctuate heavily because the amount of unseen areas is different each sample. The eval script should be here. https://github.com/ken2576/vision-nerf/blob/main/eval.py

And I have one more question. You said in your paper that the decoder uses regular convolution, but I didn't find the corresponding part, can you point me to this part of the code, looking forward to your reply, thanks!

ken2576 commented 1 month ago

Try the script from PixelNeRF to calculate the metrics. Maybe you need to change the paths to match the outputs. https://github.com/sxyu/pixel-nerf/blob/master/eval/calc_metrics.py

I think the conv is defined here. https://github.com/ken2576/vision-nerf/blob/main/network/vit.py#L468