hexawyz / NetUnicodeInfo

Unicode Character Inspector & Library providing a subset of the Unicode data for .NET clients.
https://www.nuget.org/packages/UnicodeInformation/
MIT License
59 stars 11 forks source link

Feature Request: Unicode Script property #11

Open russcam opened 4 months ago

russcam commented 4 months ago

It can be useful to know the Unicode Script property of characters, for example, in language detection.

Would you be interested in a PR to add support for Unicode scripts, and to look up the script name for a given character?

Dubzer commented 4 months ago

I'd like to see that, too πŸ‘€

russcam commented 2 months ago

@Dubzer in the meantime, you may be interested in using UnicodeScript.g.cs, generated using https://github.com/searchpioneer/lingua-dotnet/tree/main/build/Lingua.UnicodeScriptGenerator. It implements an extension method GetScript(this char ch) that returns the unicode script for the char.

hexawyz commented 2 months ago

Hello

Thanks for providing this option 😊

I do plan to address this in the library at some point, but let's be honest, this project is not really the top of my priorities at the moment. πŸ˜Άβ€πŸŒ«οΈ This change and the other one required to upgrade the Unicode version seem to require slightly non-trivial changes, so I need to reserve a chunk of time to work on this at some point. It will happen, but I can't promise when πŸ˜