Here is a tentative fix for the unsoundness issue #156.
In short: we accept not only UTF8 continuation bytes as printable in is_printable_str, but also bytes that mark beginnings of multi-byte sequences where the state change table let us down and failed to produce Utf8Begin.
Here is a tentative fix for the unsoundness issue #156.
In short: we accept not only UTF8 continuation bytes as printable in
is_printable_str
, but also bytes that mark beginnings of multi-byte sequences where the state change table let us down and failed to produce Utf8Begin.