skishore / makemeahanzi

Free, open-source Chinese character data
https://www.skishore.me/makemeahanzi/
Other
1.83k stars 465 forks source link

Why does 'matches' data often contain null values? #61

Open dhowe opened 5 years ago

dhowe commented 5 years ago

Can you explain what this means? I find around 173 cases in the dictionary where at least one value in the 'matches' field is null.

skishore commented 5 years ago

There are some characters like 必 where most strokes are parts of components, but one or two strokes are just extra. Since the extra strokes can't be assigned to a component in the character decomposition notation, their "match" is null.

I think this example might actually be written as an overlay, so all of its strokes do get matched, but others are like that. Could you note some examples here?

dhowe commented 5 years ago

One example: 齐