readbeyond / aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
http://www.readbeyond.it/aeneas/
GNU Affero General Public License v3.0
2.44k stars 218 forks source link

Can't reproduce example "Fairest Creatures" alignment #284

Closed tom-huntington closed 2 years ago

tom-huntington commented 2 years ago

On windows I got the correct alignment but not on WSL. I'll try and solve this myself, just posting in case it helps someone else.

 python -m aeneas.tools.execute_task test.mp3 test.txt  "task_language=eng|os_task_file_format=txt|is_text_type=plain|" -r="tts=espeak" map.txt

f000001 0.000 6.960 "From fairest creatures we desire increase," f000002 6.960 11.320 "That thereby beauty’s rose might never die," f000003 11.320 16.040 "But as the riper should by time decease," f000004 16.040 19.680 "His tender heir might bear his memory:" f000005 19.680 24.240 "But thou contracted to thine own bright eyes," f000006 24.240 31.520 "Feed’st thy light’s flame with self-substantial fuel," f000007 31.520 35.280 "Making a famine where abundance lies," f000008 35.280 41.440 "Thy self thy foe, to thy sweet self too cruel:" f000009 41.440 47.040 "Thou that art now the world’s fresh ornament," f000010 47.040 51.400 "And only herald to the gaudy spring," f000011 51.400 53.200 "Within thine own bud buriest thy content," f000012 53.200 53.200 "And tender churl mak’st waste in niggarding:" f000013 53.200 53.200 "Pity the world, or else this glutton be," f000014 53.200 53.240 "To eat the world’s due, by the grave and thee."

Audio produced by espeak: https://user-images.githubusercontent.com/55266932/170434589-970b0d03-1a25-4f3a-91fd-b615e0cc8e7a.mp4

tom-huntington commented 2 years ago

I followed the install from source instructions and it produced a different tts audio which gave the correct results

https://user-images.githubusercontent.com/55266932/170440392-1b36e633-637c-4553-96fb-ef20f5c2cc09.mp4

f000001 0.000 5.880 "From fairest creatures we desire increase," f000002 5.880 9.240 "That thereby beauty’s rose might never die," f000003 9.240 11.920 "But as the riper should by time decease," f000004 11.920 15.280 "His tender heir might bear his memory:" f000005 15.280 18.880 "But thou contracted to thine own bright eyes," f000006 18.880 22.760 "Feed’st thy light’s flame with self-substantial fuel," f000007 22.760 25.680 "Making a famine where abundance lies," f000008 25.680 31.240 "Thy self thy foe, to thy sweet self too cruel:" f000009 31.240 34.440 "Thou that art now the world’s fresh ornament," f000010 34.440 36.960 "And only herald to the gaudy spring," f000011 36.960 40.680 "Within thine own bud buriest thy content," f000012 40.680 44.480 "And tender churl mak’st waste in niggarding:" f000013 44.480 48.080 "Pity the world, or else this glutton be," f000014 48.080 53.240 "To eat the world’s due, by the grave and thee."

tom-huntington commented 2 years ago

I was getting the same problem again, and I had to run

python setup.py build_ext --inplace

again to fix it