unicode-rs / unicode-normalization

Unicode Normalization forms according to UAX#15 rules
https://unicode-rs.github.io/unicode-normalization
Other
158 stars 40 forks source link

Add API to correct defective combining character sequences #98

Open Jules-Bertholet opened 7 months ago

Jules-Bertholet commented 7 months ago

My motivation here is that this API pairs well width unicode-width (or with any terminal that uses similar rules to unicode-width for determining character advance width, IOW ~all of them). Defective combining character sequences are one of the main cases that these width APIs don't handle properly, so correcting them beforehand is useful. (#97 has a similar motivation.)

Manishearth commented 7 months ago

In that case please include some description of this API's use cases in its documentation.

I still don't know if or when I'll be able to review this and the other PRs.