XLSX has a SharedStringTable for all unique strings in the sheets. For a certain ratio "number of string cells" / "unique string", we could create Dictionary Vectors instead of Flat Vectors.
Open questions:
[ ] How do we find out which columns have certain ratios? Maybe one column has a lot of unique string whereas another has only a few -- one should be a Flat Vector and the other a Dictionary Vector.
XLSX has a
SharedStringTable
for all unique strings in the sheets. For a certain ratio"number of string cells" / "unique string"
, we could create Dictionary Vectors instead of Flat Vectors.Open questions: