Is this could be used for audio synthesis?

hubertsiuzdak / snac

Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate

https://hubertsiuzdak.github.io/snac/

MIT License

443 stars 26 forks source link

Open MonolithFoundation opened 2 months ago

MonolithFoundation commented 2 months ago

For instance, LLM out produce snac tokens, and decode into audio?

itsliupeng commented 2 months ago

FYI. https://github.com/gpt-omni/mini-omni uses snac codec to generate audio.

MonolithFoundation commented 2 months ago

Thanks for the hint, how about Chinese?

MrWaterZhou commented 6 days ago

I tried, and it works well :)

MonolithFoundation commented 6 days ago

Woo, does it support Madrian and Japanese?

MrWaterZhou commented 6 days ago

Woo, does it support Madrian and Japanese?

Not yet, but we are working on Madrian and will release it soon.