Potentially useful pretrained models

lucidrains / audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

MIT License

2.33k stars 249 forks source link

Potentially useful pretrained models #169

Closed gkucsko closed 1 year ago

gkucsko commented 1 year ago

Hey friends, we recently release some text-to-audio models that are very similar to the great work you all are doing here. Maybe some of them can be used as a starting point for finetuning within this repo (esp semantic and coarse/fine). https://github.com/suno-ai/bark

lucidrains commented 1 year ago

@gkucsko nice results! this confirms that Encodec + AudioLM scheme should work well together

feel free to repost this to discussions, as this is not an issue

gkucsko commented 1 year ago

good point, thanks for all the work here!