felipenoris / XLSX.jl

Excel file reader and writer for the Julia language.
https://felipenoris.github.io/XLSX.jl/stable
Other
275 stars 58 forks source link

Failed to parse internal XML file `xl/comments95.xml` #253

Open GeoffRussell opened 9 months ago

GeoffRussell commented 9 months ago

I've downloaded a bunch (649) of xlsx files from the UNFCCC ... National Greenhouse gas inventories. While XLSX parses most of them successfully, there are a few with the following error ...

ERROR: LoadError: XMLError: Document is empty from XML parser (code: 4, line: 1)

See attached file ...

HUN_2023_2008_15042023_034852.xlsx

I certainly don't need the "internal XML file", so some way of ignoring them would be enough of a workaround for me.

Many thanks

GeoffRussell commented 9 months ago

Further to this issue ... I can't read 203 of the 649 files.

GeoffRussell commented 9 months ago

I'm slowly learning about these xlsx files ... from a position of total ignorance. It seems like the problems are due to some of these so-called comments files being empty. Why should an empty file be considered a showstopper?

GeoffRussell commented 9 months ago

If I put some properly formatted xml in the comments file, the problem vanishes. But there can be multiple empty comments file in each xlsx file and I can't work out a method of automating the patching process.