Character class narrow unicode

I was seeing some interesting behavior when python2 had only unicode ucs2 support:

$ python
Python 2.7.18 (default, Sep  1 2020, 16:08:16)
>>> s = u'\uD859\uDFCC'
>>> s
u'\U000267cc'

u'\uD859\uDFCC'.encode("UTF-32").decode("UTF-32")
u'\U000267cc'

It was taking the utf-16 hex codes (\uD859 and \uDFCC) and converting them to the utf-32 hex code (\U000267cc) behind the scenes. I have methods like repr_string and repr_bytes and I might want to add some utf-8 (bytes), utf-16 (the \u values) and utf-32 (the \U values) methods just so you can get more information about the character. To see how all these come together, you can use fileformat.info and these are some pages I had open:

Unicode encoding : utf-8 , utf-16 , utf-32
How to convert from utf-16 to utf-32 on Linux with std library?

search:

python utf16 to utf32
convert utf16 to utf32

Jaymon / datatypes

Character class narrow unicode #10