KoljaB / RealtimeTTS

Converts text to speech in realtime
2.01k stars 201 forks source link

is styleTTS supported #11

Open itsPreto opened 11 months ago

itsPreto commented 11 months ago

https://github.com/yl4579/StyleTTS2 seems very promising, was wondering if its supported by this library

KoljaB commented 11 months ago

The approach looks good.

Must haves for a TTS to be included in RealtimeTTS:

Nice to haves:

They don't offer pip install currently, and I don't want losing this feature for RealtimeTTS and have people deal with stuff like git clone if possible. Would not be that hard to implement though looking at their requirements. I also need to check performance and streaming inference, currently I don't know if it's performant enough for realtime.

Anon2578 commented 11 months ago

Yes adding Styletts will bring more better quality to open source tts

KoljaB commented 11 months ago

Ok, persuaded. Just tried the model and yep, speed and quality are really good. I'll integrate this as next engine soon.

itsPreto commented 11 months ago

Ok, persuaded. Just tried the model and yep, speed and quality are really good. I'll integrate this as next engine soon.

How much work do you think it would be to support Silicon Macs? Most come with at least 16GB of unified RAM.

KoljaB commented 11 months ago

If the engine does not natively support Mac, chances are near zero I can do anything. Don't have one, can't test nothing.

To styletts I picked up so far:

Languages and platforms may be bit of a bummer, need some more answers / research.

aehlke commented 11 months ago

https://github.com/codeamt/PyTorchSwift