Additional config details from the hugging face checkpoints

ZhikangNiu / encodec-pytorch

unofficial implementation of the High Fidelity Neural Audio Compression

MIT License

131 stars 13 forks source link

Hi there,

Thanks a bunch for your effort on that. This is fantastic work.

I was wondering if you could provide a bit more details about the configuration you've used in training the checkpoints you've provided on hugging face ? They sound great and I'd like to re-train them for my own purpose. From their file names, I can infer the following: batch_size=12, tensor_cut=100000, and lr=0.0001, is this right ? What about warmup_epoch, for example ? Additionally, did you use only a subset of LibriTTS or the full 960 hours ?

Thanks again !

ZhikangNiu / encodec-pytorch

Additional config details from the hugging face checkpoints #17