Open rmisev opened 2 months ago
@markusicu @macchiati could you provide some context for these changes? While we submitted a bunch of feedback (as recorded in #744) it seems there's quite a few other changes as well.
E.g., Ⅎ
is an invalid domain name today, but with Unicode 16 would be valid?
(We are seeing this in WebKit as well now: https://github.com/WebKit/WebKit/pull/37104.)
We considered several issues and made recommendations that the UTC approved.
For example, there were complicated and unnecessary differences in processing with UseSTD3ASCIIRules=true vs. false, and some characters were disallowed based on differences between IDNA2003 and IDNA2008, while (a) IDNA2003 has not been relevant in a long time and (b) transitional processing had been deprecated in Unicode 15.1.
For details see
What is the issue with the URL Standard?
Version 16.0.0 (2024-08-30) of Unicode Technical Standard #46 has been released. It fixes some previously reported issues:
So I think it's worth upgrading to that standard:
IdnaTestV2-parser.py
tool and theIdnaTestV2.json
test file. I have opened a PR for this: https://github.com/web-platform-tests/wpt/pull/48301