Open sh-lee-prml opened 2 years ago
To compare SIREN layer with RELU +MLP, we implement two models.
audio signal (B, T, 1) --> Linear(B, T, 128) +RELU --> Linear(B, T, 128) +RELU --> Linear(B, T, 128) +RELU --> Linear(B, T, 1) --> reproduced signal (B, T, 1)
audio signal (B, T, 1) -->SIREN layer --> SIREN layer -->SIREN layer --> Linear(B, T, 1) --> reproduced signal (B, T, 1)
In your paper, RELU+MLP is not able to reproduce the audio signal, However, First model can reproduce audio signal even better than SIREN...
SIREN is also very instability so i used lower learning rate. But the loss fluctuated.
Could you explain why the siren is better than others in audio reproduction domain?
Hi, I also doubt about this. Do you figure why this happen now?
To compare SIREN layer with RELU +MLP, we implement two models.
audio signal (B, T, 1) --> Linear(B, T, 128) +RELU --> Linear(B, T, 128) +RELU --> Linear(B, T, 128) +RELU --> Linear(B, T, 1) --> reproduced signal (B, T, 1)
audio signal (B, T, 1) -->SIREN layer --> SIREN layer -->SIREN layer --> Linear(B, T, 1) --> reproduced signal (B, T, 1)
In your paper, RELU+MLP is not able to reproduce the audio signal, However, First model can reproduce audio signal even better than SIREN...
SIREN is also very instability so i used lower learning rate. But the loss fluctuated.
Could you explain why the siren is better than others in audio reproduction domain?