Open IMBAepsilon opened 2 months ago
Thanks for the report.
The align-transcript-and-translation
is a complex operation that combines alignment engines and a special word embedding model.
Due to how the text is tokenized when passed to the embedding model, it's possible that there are various edge cases where the tokenization and de-tokenization fails to match the original text.
I'll need the exact inputs used so I can reproduce the error and determine how to fix it.
when I use
I got