open-i18n / rust-unic

UNIC: Unicode and Internationalization Crates for Rust
https://crates.io/crates/unic
Other
234 stars 24 forks source link

Upgrade to Unicode 12.0 #260

Open eyeplum opened 5 years ago

eyeplum commented 5 years ago

Description

Update external data and all modules to Unicode 12.0. For changes in Unicode 12.0, see: https://www.unicode.org/versions/Unicode12.0.0/

According to Section M of the change note, migrating from Unicode 11.0 to 12.0 should be straight forward. For us, presumably it's basically updating all data files to Unicode 12.0 and regenerate all tables.

Blocked by #259 .

Unicode 12.1

It might be trivial to upgrade to Unicode 12.1 at the same time (assuming this issue will be implemented after 2019 May 7), as it only adds one character U+32FF SQUARE ERA NAME REIWA.

For details, see: https://unicode.org/versions/Unicode12.1.0/

data-man commented 4 years ago

Time for Unicode 13.0 :)

crlf0710 commented 1 year ago

Time for Unicode 15.0 :(

I'm... trying to help implementing https://github.com/rust-lang/rust/issues/101840 . Currently rustc is relying on unic_emoji_char::is_emoji for diagnostics. But it seems the Unicode data here is quite outdated...

eyeplum commented 1 year ago

Time for Unicode 15.0 :(

I'm... trying to help implementing rust-lang/rust#101840 . Currently rustc is relying on unic_emoji_char::is_emoji for diagnostics. But it seems the Unicode data here is quite outdated...

Hi there!

I have been using my own fork in recent years, the fork is currently updated to Unicode 14.0 (and will be updated to Unicode 15.0 soon).

I've been meaning to eventually merge those changes into here, perhaps now is a good time to give it a go :)

Cc: @behnam

eyeplum commented 1 year ago

My fork current has these changes:

If we want them to be merged here, I think we will need to decide a way to release the changes. Perhaps each Unicode version as a point releases? E.g.:

Though I'm not sure what's the best way to handle the new Unihan module... Perhaps we can just chuck it in one of those releases...