w3c / findtext

An API spec to define how to find text in a Web document, using basic information, and return DOM ranges
14 stars 7 forks source link

order of case fold and normalization operations [I18N-ISSUE-503] #12

Open aphillips opened 9 years ago

aphillips commented 9 years ago

http://www.w3.org/International/track/issues/503 [I18N-ISSUE-503]

http://www.w3.org/TR/2015/WD-findtext-20151015/#performing-a-search-operation

In the section "Search Algorithm" within this section, case fold mapping appears as step 11. Normalization appears as step 13.

Since normalization (particularly the compatibility mapping) may change the character sequence, it would be better to apply the case folding after the normalization, particularly if the case fold is language-sensitive.