adrian-thurston / colm

The Colm Programming Language
MIT License
166 stars 32 forks source link

[colm] work with unsigned chars #104

Closed adrian-thurston closed 4 years ago

adrian-thurston commented 4 years ago

Currently using a bare "char" as the alphabet type. We assume this is signed, which itself is a problem on some architectures. Really we should be going to unsigned, so we can reference chars with the high bit set in the way that is common when discussing utf and unicode: using hex chars. Negative values are really uncomfortable and don't lend themselves to specifying ranges.

More information in: #97 #81