rogerbinns / apsw

Another Python SQLite wrapper
https://rogerbinns.github.io/apsw/
Other
739 stars 97 forks source link

Update to Unicode 16.0 when released September 2024 #514

Closed rogerbinns closed 1 month ago

rogerbinns commented 6 months ago

See https://blog.unicode.org/2024/05/unicode-technical-committee-utc-updates.html and https://www.unicode.org/versions/beta-16.0.0.html

Of note there are some tweaks to segmentation and line breaking that will need to be implemented, in addition to updating all the data tables.

rogerbinns commented 6 months ago

Changes to line breaking TR14 are documented after the big table at https://www.unicode.org/reports/tr14/tr14-52.html#History ("Revision 52") which is quite a few rule changes.

Changes to segmentation TR29 are documented at https://www.unicode.org/reports/tr29/tr29-44.html#Modifications and are minor.

rogerbinns commented 2 months ago

For TR29 some codepoints changed category, and new codepoints were added, but there are no changes to the rules and the test files are unchanged.

rogerbinns commented 1 month ago

This is now done with d83e7910a843a60b0e6cb28e0025c2c1a3306e20 being the last commit. The TR14 rules are contradictory and finicky to implement.