linto-ai / whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence
GNU Affero General Public License v3.0
2.01k stars 156 forks source link

Please consider creating a node version #138

Closed 55Cancri closed 8 months ago

55Cancri commented 11 months ago

A node version will give access to a whole new segment of developers and allow word-level timestamps to be produced directly inside of a node lambda after generating audio with openais recent tts api. By having this in javascript, it will also make it easier for js devs to create audio-text synchronization or "karaoke-style" highlighting in their user interfaces when reading text.

iSuslov commented 11 months ago

As a workaround you can spawn a nodejs child process to call whisper_timestamped command