Closed noellabo closed 5 years ago
Older Japanese websites are encoded with Shift_JIS or EUC-JP instead of UTF-8 for historical reasons.
What's worse, often the web server doesn't return encoding information in HTTP response.
Therefore, you need to get the encoding from the HTML content's META tag.
This improvement may be useful for other language encodings with similar issues.
Great! Thanks
Older Japanese websites are encoded with Shift_JIS or EUC-JP instead of UTF-8 for historical reasons.
What's worse, often the web server doesn't return encoding information in HTTP response.
Therefore, you need to get the encoding from the HTML content's META tag.
This improvement may be useful for other language encodings with similar issues.