I have a few questions regarding the code and paper:
In the repository, I cannot find the code for the output residual connection of the hypernetwork, can you point it out?
In the "Output Encoding" part of the paper, you introduce a set of learnable parameters. So will your training of hypernetwork + primary network will have a parameter count slightly larger than traditional training of primary network only?
Hi, thank you for your interesting research.
I have a few questions regarding the code and paper: