t-sin / inquisitor

Encoding/end-of-line detection and external-format abstraction for Common Lisp
34 stars 3 forks source link

Is UTF16 equal to UCS2? #51

Closed t-sin closed 6 years ago

t-sin commented 7 years ago

Confirmation required.

from https://github.com/t-sin/inquisitor/pull/50/files#diff-8b59a91a78709dc487f614e8f8f07c29R29

t-sin commented 7 years ago

If these must be distinguished, there is need to add UCS2 into name mapping.

t-sin commented 6 years ago

UCS2 means characters on Basic Multilingual Plane in terms of Unicode (UTF-16).

In the name map above, all implementation dependent name mean Unicode. Now, in the Unicode era, characters on the out of BMP (e.g. emoji) are usually and widely used. I think Unicode means UTF-16, not UCS-2, in the future.

Because of this, I decided to rename :ucs-2le/be to utf-16be/le.

t-sin commented 6 years ago

Renamed at https://github.com/t-sin/inquisitor/commit/524ec5bf8a62063bcb62cc5613551c4ec65816de then closed.