met4citizen / TalkingHead

Talking Head (3D): A JavaScript class for real-time lip-sync using Ready Player Me full-body 3D avatars.
MIT License
306 stars 98 forks source link

can i use msedge tts for lip sync? #64

Open 24wemy opened 3 days ago

met4citizen commented 2 days ago

I'm not sure. In addition to audio, the TalkingHead class requires word-level timestamps for accurate lip-sync and the MSEdge TTS appears to be a very limited version of Microsoft Azure TTS. However, it does include a wordBoundaryEnabled option in its request headers. If setting that flag to true works, and the server actually delivers word-level timestamps, then you might be able to make it work.

24wemy commented 2 days ago

I'm looking for a free TTS, because I don't have access to Google TTS or Microsoft Azure, can you help me?

met4citizen commented 1 day ago

The TTS you suggested doesn't seem to have any official API documentation, which implies that it is intended for Microsoft's internal use only. Moreover, the projects that use it do so by pretending to be Edge browsers, which most likely violates Microsoft's API policies. -- If you can find some legal document stating that their API can be used by anyone for free, I'm happy to help. Otherwise, no.