in addition to formal lines, add optional instances of "filler" text, to better guide the user when recording timestamps. For example, suppose between lines X and X+1 there are lyrics that will be sung but you might not want formal subtitles for them. Such lyrics would be called "filler", and are NOT part of any textHeard. (259600d ✅)
if possible, give the user the choice of highlighting words either by whole word (current implementation) or by syllable.
textHeard
. (259600d ✅)