Closed boxabirds closed 1 year ago
If it helps I have (correct) output from what I think was v 1.7.2 -- not sure if you have regression tests in place to check?
Update: adding vad=True
fixed the issue of first timestamp being zero, but the other settings made no difference. I'll keep investigating.
Thank you for reporting. There are regression tests in place, but it's hard to guarantee that results won't degrade on particular cases.
I fixed a wrong heuristics that was causing the trouble you experienced, when there is music before speech. It should be better now.
Amazing thank you so much. It’s such a great tool.
On Tue, 9 May 2023 at 10:36, Jérôme Louradour @.***> wrote:
Closed #91 https://github.com/linto-ai/whisper-timestamped/issues/91 as completed via 863d56d https://github.com/linto-ai/whisper-timestamped/commit/863d56d0d1dfe779ca2dd73f4db2df2f48e6108e .
— Reply to this email directly, view it on GitHub https://github.com/linto-ai/whisper-timestamped/issues/91#event-9198323583, or unsubscribe https://github.com/notifications/unsubscribe-auth/AABD62JTV4LBFMPDIWZGG2DXFIF3TANCNFSM6AAAAAAXYDRMVY . You are receiving this because you authored the thread.Message ID: @.***>
Hi attached is an mp3 that has the first line of the verse at 8s and the second at 16s. But it's not being reported as such; in particular start is always zero (but should be closer to 8s).
= Expected =
= Observed =
= To reproduce =
of note
seaside-clip-long.mp3-small.json.zip