huggingface / parler-tts

Inference and training library for high-quality TTS models.
Apache License 2.0
2.6k stars 265 forks source link

Looking for a way to combine spoken words with timestamps in output dictionary #32

Open bartekupartek opened 3 weeks ago

bartekupartek commented 3 weeks ago

Would it be possible to combine words with timestamps and perhaps return optionally dict with audio tensor and transcription mapping?