[X] Bug fix (involves code and configuration changes)
About
In some cases a binary string may pass as valid UTF-8 to the mb_check_encoding(..., 'UTF-8') function. Use a comprehensive regexp from the W3 group instead to be certain we aren't trying to parse binary content in formatContent(). In addition to (strings), also check for the beginning of ID inline image content sections, which may also contain binary. Resolves #668.
In case you changed the code/configuration, please read each of the following checkboxes as they contain valuable information:
[X] Please add at least one test case (unit test, system test, ...) to demonstrate that the change is working. If existing code was changed, your tests cover these code parts as well.
Type of pull request
About
In some cases a binary string may pass as valid UTF-8 to the
mb_check_encoding(..., 'UTF-8')
function. Use a comprehensive regexp from the W3 group instead to be certain we aren't trying to parse binary content informatContent()
. In addition to(strings)
, also check for the beginning ofID
inline image content sections, which may also contain binary. Resolves #668.Reference: https://www.w3.org/International/questions/qa-forms-utf-8.en
Checklist for code / configuration changes
In case you changed the code/configuration, please read each of the following checkboxes as they contain valuable information:
fixes #1234
to outline that you are providing a fix for the issue#1234
.