readbeyond / aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
http://www.readbeyond.it/aeneas/
GNU Affero General Public License v3.0
2.45k stars 218 forks source link

how to ignore texts in square brackets please? #239

Closed liujinliang01 closed 4 years ago

liujinliang01 commented 4 years ago

Hello,

I can see that there is an option named "is_text_file_ignore_regex", but I cannot find any examples as to how to use it.

Specifically, I need to tell aeneas to ignore texts between square brackets like this: [pause] [Woman]: Hi Tony, good to see you again. [Man]: Hi Liz, good to see you too. [some Chinese translation]

Could anybody please help? best regards Jim

liujinliang01 commented 4 years ago

Oh, luckily I have found the syntax for ignoring text from the fantastic online aeneas tool at www.aeneasweb.org.

Just in case anybody else has the same request as mine, here is the syntax for telling aeneas to ignore texts between square brackets: is_text_file_ignore_regex=\[.*?\]

It works like a charm! Thanks to the great creator/s for this wonderful tool. best regards Jim