Escaped `<` characters (`<`) are processed incorrectly

This is a more specific follow-up to #182.

When the < escape sequence is processed, it is incorrectly converted to &LT instead of kept as-is:

>>> import minify_html
>>> print(minify_html_onepass.minify("&lt;"))
<

>>> print(minify_html_onepass.minify("&lt;faketag"))
&LTfaketag

>>> print(minify_html_onepass.minify("&lt;faketag&gt;"))
&LTfaketag>

Strangely, a bare < by itself is processed correctly. It is only when followed by content that it breaks.

The issue occurs in both minify_html and minify_html_onepass.

We are able to work around it as follows:

html = html.replace("&lt;", "AMP_LT_WORKAROUND")
html_minified = minify_html.minify(html)
html = html.replace("AMP_LT_WORKAROUND", "&lt;")

but a proper fix would be better (and more efficient, as we process tens of thousands of HTML files at a time).

wilsonzlin / minify-html

Escaped `<` characters (`<`) are processed incorrectly #191

wilsonzlin / minify-html

Escaped `<` characters (`&lt;`) are processed incorrectly #191

Escaped `<` characters (`<`) are processed incorrectly #191