mbleron / ExcelTable

ExcelTable is an Oracle SQL interface to read Microsoft Excel files (.xlsx, .xlsm, .xlsb, .xls, .xml), ODF spreadsheet files (.ods) and flat files as external tables
MIT License
45 stars 12 forks source link

Read control characters stored in Unicode escaped form #46

Open mbleron opened 1 year ago

mbleron commented 1 year ago

As per ECMA-376 §22.9.2.19, Excel strings are stored as ST_Xstring which supports escaped invalid XML characters, i.e. control characters [0x00-0x1F] minus 0x09, 0x0A and 0x0D. ExcelTable does not escape back such characters when reading string cells.