Verbatim SubER - Githubissues

Yes, I also already thought about this, see last line in the README :) I actually had a somewhat hacky way of getting the alignment out of the TER algorithm during initial development. I would need to do a clean version of that. A first step would be to just get the number of deletions, insertions and substitutions. (Probably separately for word and break tokens). The other option would be to really give detailed Levenshtein alignment information per sentence. The original TER tool has that. What did you have in mind? Both, I guess? :) The "problem" is that I use the sacrebleu implementation of TER. It does not provide this information and I want to avoid altering it too much because I treat it as a reference implementation. But I can try to come up with a compromise. 😉

(Sorry, late reply due to vacation.)

apptek / SubER

Verbatim SubER #3