IRT-Open-Source / scf

Subtitling Conversion Framework
Apache License 2.0
52 stars 18 forks source link

STLXML2STL: composite sequences not correctly mapped #55

Closed spoeschel closed 4 years ago

spoeschel commented 4 years ago

Composite sequences with diacritical characters are not correctly mapped from STLXML to STL. This includes e.g.:

J́
j́
J̃
L̃
M̃
R̃
j̃
l̃
m̃
r̃
E̊
e̊

The reason is the different order of the diacritical combining character. While in Unicode it is a suffix, in EBU STL it is a prefix. So the char order has to be switched in such a case.