Closed bitsgalore closed 10 years ago
Had another look at this: the actual parsing of the XML isn't the problem, but post-processing in etpatch.py/makeHumanReadable goes wrong in convert step at the bottom.
What might work:
def remove_control_characters(s): return "".join(ch for ch in s if unicodedata.category(ch)[0]!="C")
Fixed in 1.11.1!
While running jpylyzer with --nullxml OPTION on this file:
http://sdowww.lmsal.com/sdomedia/hv_jp2kwrite/v0.8/jp2/AIA/2014/02/01/304/2014_02_01__00_11_07_13__SDO_AIA_AIA_304.jp2
Doing this with Python 2.7, some of the resulting XML elements are empty, even though in reality they do contain text (e.g. look at at the bottom). Using Python 3.3 it works correctly. So something goes wrong while parsing the XML. Could be a bug in ElementTree.
This also affects the Windows executables / Debian packages, since they are built using Python 2.7.