codegooglecom / cjklib

Automatically exported from code.google.com/p/cjklib
Other
0 stars 0 forks source link

Invalid decomposition entries for 卪, 叉 and 丼. #26

Open GoogleCodeExporter opened 8 years ago

GoogleCodeExporter commented 8 years ago
What steps will reproduce the problem?
1. Setup:
from cjklib import characterlookup
cjk = characterlookup.CharacterLookup('C')

2. Break stuff:
for char in [u'卪', u'叉', u'丼']:
    try: cjk.getStrokeOrder( char )
    except: pass

What is the expected output? What do you see instead?
I get the following output (times three):
.../site-packages/cjklib/characterlookup.py:1204: UserWarning: Invalid 
decomposition entry [u'\u2ff4', (u'\u5369', 0), (u'\u4e36', 0)]
  "Invalid decomposition entry %r" % subTree)

I'm not sure what I'm expecting. There doesn't seem to be decomposition data 
for these characters, but that is not the problem. The error I get is caused by 
inconsistent use of IDS data.
According to characterlookup.py (line 1195): 
# ⿴ should only occur for 囗
This is the case for neither 卪, 叉 nor 丼, resulting in the errors.

What version of the product are you using? On what operating system?
0.3.2 from pip.
python 2.7
OSX 10.6.8 Snow Leopard.

Please provide any additional information below.

Original issue reported on code.google.com by casperle...@gmail.com on 29 May 2014 at 3:40

GoogleCodeExporter commented 8 years ago
Oops. Obviously, there is decomposition data. There is no stroke order data 
though.

Original comment by casperle...@gmail.com on 29 May 2014 at 3:43