Open norikawa opened 6 days ago
I've also gone through and confirmed that the incorrect characters only appear in this particular circumstance, so this method of replacing every instance of the given character/sequence will never replace good Japanese text with bad replacement text.
I haven't confirmed that this list covers all bad conversions though, I doubt I found all of them!
There is a translation table in some of the js files, so I think we'd just need to add some new characters to the list? Will check.
Some song names (and possibly other details such as artist names, though I've mostly only seen song names for this) are incorrect when converted to UTF-8 from SHIFT-JIS. For example, "Entropic EnĤαncemEnt" is instead converted to "Entropic En鹹αncemEnt"
For my own WIP server in Crystal, I've gone through and found most of the incorrect characters and made a list of them to automatically replace them after the conversion step:
I'm not familiar enough with TypeScript to know what the equivalent functions would be, but hopefully the list of replacements can be helpful at least!