lowerquality / gentle

gentle forced aligner
https://lowerquality.com/gentle/
MIT License
1.45k stars 295 forks source link

mp3 file of the same song returns different alignment results #276

Open casillano opened 4 years ago

casillano commented 4 years ago

Tested on https://www.youtube.com/watch?v=B8WjnoZsXXM

I am using two methods to convert youtube videos to mp3 files: the first is https://ytmp3.cc/en13/ and the second is https://www.npmjs.com/package/youtube-dl. When I convert a video using the first method and pass the audio into the aligner, it works as expected. But using the second method, the aligner is not as accurate. The transcript used is the same for both methods.

I have also noticed that the file sizes for the two mp3 files differ (the mp3 from the first method is larger than the mp3 from the second method.

Note that for the first method, I am downloading the mp3 onto windows while on the second method I am downloading to linux.

natelawrence commented 4 years ago

In my experience, you can submit the exact same input media with identical transcript multiple times and get different results each time.

casillano commented 4 years ago

Right, they may vary a bit. But from what I've seen, converting and downloading videos from the site results in consistently better alignment than using youtube-dl.