Closed suvigy closed 4 months ago
I noticed this occurred sometimes when training past 30k iterations for some reason. Can you try training the nerfacto to just 20k iterations, and then continuing with in2n on that checkpoint? For some reason that tends to just fix the issue. Let me know if that does not work
I tried it with 20k iterations but still it gives me Nan-s already at the beginningI also tried to lower the lpips loss multiplicator, but still resulting NaN-s at the beginning (on the gt paches).
This is how I try. My scene does not have object centric camera path, rather backward driving. I turned off camera optimization, because colmap path was quite pixel accurate, turning it on just screws up the camera positions.
ns-train in2n --data <my transforms.json> --output-dir <output dir> --load-dir <trained nerfacto model dir> --pipeline.model.camera-optimizer.mode=off --pipeline.prompt "<my prompt>" --pipeline.guidance-scale 7.5 --pipeline.image-guidance-scale 1.5 nerfstudio-data --downscale-factor 4
I optionally tried: --pipeline.model.lpips-loss-mult 0.4
but stull it NaN-s
Also tried fields learning rate warmup and lower the learning rate (--optimizers.fields.optimizer.lr=0.005 --optimizers.fields.scheduler.warmup-steps=1000
) but didn't help
I see. Unfortunately it might be just a weird case of your specific dataset, as I do remember running into some of these issues. Once solution would be to just set the LPIPS loss to be off, and the results should still be pretty decent.
Thanks for you work. I try to use in2n with a trained nerfacto model. But I'm getting NaN-s when using lpips.
I checked a bit, and I could see the gt_patches will contain NaN-s, at the beginning of the training sometimes, but then populates, and after some steps all the gt_patches will contain NaN-s.
Maybe could give me some hint what the reason could be? I don't use cameraoptimizer and but use masks. First I thought maybe it is because of the masks, but since the NaN-s are populating, I guess the reason is different. Image resolution is: 480x320 (closest to 512x512)