FasterXML / aalto-xml

Ultra-high performance non-blocking XML processor (Stax API + extensions)
Apache License 2.0
292 stars 70 forks source link

Async parsing turns " inside element content into apostrophe #78

Closed jphelp32 closed 2 years ago

jphelp32 commented 2 years ago

This issue does not occur in 1.2.2. It is present in 1.3.1. I'm told it's also present in 1.3.0, but I haven't confirmed that myself.

I think I can see the code that's doing this, and that it's been in place for a long time. Wondering if maybe other fixes in 1.3.x have caused this long-standing bug to surface where before it was hidden?

See:

Also two locations in AsyncByteArrayScanner.java

jphelp32 commented 2 years ago

@cowtowncoder any thoughts on this? if accurate, it seems like a pretty severe issue. Could a fix be released in the near term?

cowtowncoder commented 2 years ago

@jphelp32 Yes, if someone provides a patch it could definitely be released. I do not have time to work on Aalto right now but can always help others with PRs.

cowtowncoder commented 2 years ago

Ok, I managed to find some time to work on this: fix itself is not a problem (trivial copy-paste problem) but just had to go and find a test to modify to verify goodness of fix for both input sources (byte array and ByteBuffer backed).

I'll release 1.3.2 soon, maybe today, with this fix. Thank you again for reporting the issue @jphelp32 !

jphelp32 commented 2 years ago

Thanks @cowtowncoder !