Open MnO2 opened 5 years ago
Examples here: https://developer.apple.com/swift/blog/?id=30
"abcde\u{0301}\u{1100}\u{1161}\u{AC00}" should not be segmented as "abcde" and "\u{0301}\u{1100}\u{1161}\u{AC00}". "e\u{0301}" should be together.
https://unicode-rs.github.io/unicode-segmentation/unicode_segmentation/struct.Graphemes.html This could be considered. But only required if the behaviour of re_han results into incorrect segmentation by SplitMatch
Examples here: https://developer.apple.com/swift/blog/?id=30
"abcde\u{0301}\u{1100}\u{1161}\u{AC00}" should not be segmented as "abcde" and "\u{0301}\u{1100}\u{1161}\u{AC00}". "e\u{0301}" should be together.