Open bwagner opened 4 years ago
I think that this is just some variety in playback positioning - especially when using MP3 files. For the next transcription, I'm going to try to use WAV files for the generation and the playback to see if this makes a difference.
When using an offset of 27.664 for episode 303 (Scott Morgan (Loscil)):
At the beginning, text and audio are perfectly in sync, but successively, the audio seems to slip ahead, e.g. the monologue
should start at 410.48 according to the json, but in fact already starts around 409.46.
I don't understand why this is happening, as e.g. the text keeps perfectly in sync for