The Unicode Ethereum Project is an initiative to make it easier to work with unicode and string data within Solidity. Hope this helps out developers in the future!
From the project README,
Unicode data and algorithms are essential to any major programming language. Solidity, like many lower-level programming languages, represent strings as a UTF-8 encoded bytes and does not natively support character-based operations like length, charAt, or isLowercase. There are popular third-party libraries for string manipulation, like https://github.com/Arachnid/solidity-stringutils, but none that provide information about the underlying Unicode characters. If you are building an application or contract that receives user input as strings, understanding user input is critical for any validation, sanitization, or standardization logic.
Checklist
[x] The URL is not already present in the list (check with CTRL/CMD+F in the raw markdown file).
[x] Each description starts with an uppercase character and ends with a period. Example: solc-js - JavaScript bindings for the compiler.
[x] Drop all A / An prefixes at the start of the description.
[x] Avoid using the word Solidity in the description.
Hi š
The Unicode Ethereum Project is an initiative to make it easier to work with unicode and string data within Solidity. Hope this helps out developers in the future!
From the project README,
Checklist
Example:
solc-js - JavaScript bindings for the compiler.
A
/An
prefixes at the start of the description.Solidity
in the description.