kaldi-asr / kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.
http://kaldi-asr.org
Other
14.11k stars 5.31k forks source link

Sync audio with text ??? #4807

Open Sadk1999 opened 1 year ago

Sadk1999 commented 1 year ago

audio there may be some pauses by the speaker and then he continues speaking, how can the pause be compensated by putting a (((space)))???

And sometimes there is an extended word like: ((oooooooooh))) my god how is the letter repeated?

More details:

jtrmal commented 1 year ago

most probably not. It generally depends on how the acoustic model was trained and what units it uses.

y.

On Wed, Nov 2, 2022 at 7:18 PM Sadk1999 @.***> wrote:

audio there may be some pauses by the speaker and then he continues speaking, how can the pause be compensated by putting a (((space)))???

And sometimes there is an extended word like: ((oooooooooh))) my god how is the letter repeated? More details:

https://www.b4x.com/android/forum/threads/synchronize-text-and-audio-together.143692/#post-912007

— Reply to this email directly, view it on GitHub https://github.com/kaldi-asr/kaldi/issues/4807, or unsubscribe https://github.com/notifications/unsubscribe-auth/ACUKYX7DUDWQAZ2JSQUY2UTWGLZFTANCNFSM6AAAAAARVTFH3U . You are receiving this because you are subscribed to this thread.Message ID: @.***>

Sadk1999 commented 1 year ago

most probably not. It generally depends on how the acoustic model was trained and what units it uses>

Using (Voice Activity Detection), specify when it is off. Space append to text wiki Video :VAD

jtrmal commented 1 year ago

I'm sorry, I'm not sure what you are saying or asking. y.

On Thu, Nov 3, 2022 at 10:52 AM Sadk1999 @.***> wrote:

most probably not. It generally depends on how the acoustic model was trained and what units it uses. y. … <#m-366221123871178336> On Wed, Nov 2, 2022 at 7:18 PM Sadk1999 @.> wrote: audio there may be some pauses by the speaker and then he continues speaking, how can the pause be compensated by putting a (((space)))??? And sometimes there is an extended word like: ((oooooooooh))) my god how is the letter repeated? More details: https://www.b4x.com/android/forum/threads/synchronize-text-and-audio-together.143692/#post-912007 https://www.b4x.com/android/forum/threads/synchronize-text-and-audio-together.143692/#post-912007 — Reply to this email directly, view it on GitHub <#4807 https://github.com/kaldi-asr/kaldi/issues/4807>, or unsubscribe https://github.com/notifications/unsubscribe-auth/ACUKYX7DUDWQAZ2JSQUY2UTWGLZFTANCNFSM6AAAAAARVTFH3U https://github.com/notifications/unsubscribe-auth/ACUKYX7DUDWQAZ2JSQUY2UTWGLZFTANCNFSM6AAAAAARVTFH3U . You are receiving this because you are subscribed to this thread.Message ID: @.>

Using (Voice Activity Detection), specify when it is off. Space append to text wiki https://en.m.wikipedia.org/wiki/Voice_activity_detection Video :VAD https://youtu.be/lQlR01AVkPo

— Reply to this email directly, view it on GitHub https://github.com/kaldi-asr/kaldi/issues/4807#issuecomment-1302232584, or unsubscribe https://github.com/notifications/unsubscribe-auth/ACUKYXZESAZBL3QTMCLUCX3WGPGTJANCNFSM6AAAAAARVTFH3U . You are receiving this because you commented.Message ID: @.***>