SoundStream improved reimplementation

Thanks for publishing this! In the encodec paper you write

For fair evaluation, we also compare EnCodec to our reimplementation of SoundStream (Zeghidour et al., 2021). [...] Finally, we compare EnCodec to the SoundStream model from the official implementation available in Lyra 2 1 at 3.2 kbps and 6 kbps on audio upsampled to 32 kHz. We also reproduced a version of SoundStream (Zeghidour et al., 2021) with minor improvements. Namely, we use the relative feature loss introduce in Section 3.4, and layer normalization (applied separately for each time step) in the discriminators, except for the first and last layer, which improved the audio quality during our preliminary studies.

And on https://ai.honu.io/papers/encodec/samples.html you show samples of this reimplementation. Could you share the source code of your SoundStream reimplementation so this work can be reproduced?

facebookresearch / encodec

SoundStream improved reimplementation #3