I found when fitting an image for SIREN, we only input a position encoding (doesn't contain the information of the image) and backdrop by comparing the output with the ground truth image. Does that mean we will input the same tensors (linear space from -1 to 1) when fitting different images? If so, the network parameters will be different for every image.
I found when fitting an image for SIREN, we only input a position encoding (doesn't contain the information of the image) and backdrop by comparing the output with the ground truth image. Does that mean we will input the same tensors (linear space from -1 to 1) when fitting different images? If so, the network parameters will be different for every image.