Closed rurban closed 7 years ago
The remaining question if certain languages need alias for sets of Scripts, because they use multiple scripts by default. Such as Japanese for Hiragana and Katakana (what about Kanji? = Han?), Korean for Hangul and Han (Chinese).
In order to avoid TR39 confusable security hacks, we add the following unicode rules for identifiers and literals:
See http://www.unicode.org/reports/tr39/#Mixed_Script_Detection
This holds for all identifiers (all names: package, gv, sub, variables) and literal numbers. The scriptname is returned by
Unicode::UCD::charscript($codepoint_as_uv)
Currently there exist 131 scripts: perl -alne'/; (\w+) #/ && print $1' lib/unicore/Scripts.txt | sort -u > scripts.lst