Open pranavmalikk opened 1 year ago
Hi @pranavmalikk,
We did not include WaveNet results in the paper because we find it very difficult to have a fair setup. There are so many choices we need to make before we setup such a comparison:
If you can be specific about these questions, I am happy to run an ablation and post the results here.
It was mentioned in the paper "Our model resembles WaveNet (Oord et al., 2016a) in the use of tree-structured dilated convolutions. However, our principle-guided design has distinct skip-connection structures and filter sharing patterns, resulting in significantly better parameter efficiency and performance...Additionally, the link we establish between wavelets and tree-structured dilated causal convolutions offers the first principled justification for the effectiveness of WaveNet in modeling raw audio waveforms, an exemplary case of lengthy sequences with multiscale structure."
Do you have any ablations on the difference in performance in any specific tasks or tests? Also any specific audio samples? Overall very interesting paper!