Closed Numpsy closed 5 years ago
@Numpsy , I've fixed this issue. It's a very strange behaviour of CodePage value (a signed short). Its max value should be 32767, but casting to an int we find 650001. I'havent still found specific documentation for this issue but I'm looking for it. Best Regards, Federico
It's a bit of an unfortunate situation, I've seen similar in c++ code that reads the property sets using the Windows native compound document apis, and that just cast it and/or interprets it as a different type to get the correct value.
All I can really say is that Windows Explorer (in Win10 at least) seems to set the code page to UTF-8 when you change the file properties through it, and i assume it knows what it's doing.
Hi,
I was trying to do a test of reading properties from a Word document using the openmcdf extensions rather than native functions, and got a
with the callstack
when calling AsOIeProperties on a SummaryInformation stream which i believe has a codepage of UTF-8. I haven't looked at it too deeply, but i've seen situations in the past where the codepage of 65001 gets interpreted as a negative number, so i'm wondering if that's what is happening here?
Thanks.
english.presets.zip