JavaScript uses surrogate pairs of UTF-16 characters to represent characters with code points higher than 65535. HTML technically doesn't like FACE WITHOUT MOUTH U+1F636 encoded as ��, but some browsers seem to be tolerant of this. Ideally, our HTML-escaper would convert this to 😶.
NB, surrogate-pairs are currently white-listed by the filter (as of #14)
JavaScript uses surrogate pairs of UTF-16 characters to represent characters with code points higher than 65535. HTML technically doesn't like
FACE WITHOUT MOUTH
U+1F636 encoded as��
, but some browsers seem to be tolerant of this. Ideally, our HTML-escaper would convert this to😶
.NB, surrogate-pairs are currently white-listed by the filter (as of #14)