Closed CSU-NXY closed 1 year ago
That's pretty weird, and shouldn't happen. What are you running? Did you modify the code?
I'm running the pinecone dataset using train_360.sh. The only changes I made are setting config.factor to 8 and config.batch_size to 1024 in 360.gin.
I'm wondering whether the NaN comes from the sanity checking step in Jax, because the output looks good after the very first forward step.
Hi, thanks for your great work. I noticed that the weights would be NaN after the first sampling, however the training pipeline does not broken and the PSNR is growing. I'm wondering why this happened and how do you deal with NaN weights?![image](https://user-images.githubusercontent.com/19389655/185566220-c6b2dd19-e6fd-4fbf-bdd1-38593fd4fd64.png)