SubtitleEdit / subtitleedit

the subtitle editor :)
http://www.nikse.dk/SubtitleEdit/Help
GNU General Public License v3.0
8.95k stars 918 forks source link

OCR word list won't recognize words starting with - #4310

Closed t2YU2m8l83 closed 4 years ago

t2YU2m8l83 commented 4 years ago

OCR of anime subtitles includes sometimes name-honorific terms eg. Sakura-chan. Adding "-chan", "-san" and "-kun" to the names/noise list will still report "chan", "san" and "kun" als unknown words for OCR. simply adding such generic 3/4 letter terms to the list would be possible but could result in false positives at other places.

niksedk commented 4 years ago

Thx for the info/idea :) Beta updated: https://github.com/SubtitleEdit/subtitleedit/releases/download/3.5.16/SubtitleEditBeta.zip

t2YU2m8l83 commented 4 years ago

I updated to 3.5.17 but it still reports "kun" and co as unknown despite "-kun" listed in the "..._names_user.xml" file. Below you can find a subtitle which contains -kun, -san, etc. First "-kun" occurs at line 16.

https://mega.nz/file/1JAgyTqa#SlZF9wSYnbHb8qw5hMGoff03OUq7bf1PPA0XSX0EoNc