HTMLRenderer: Revise usage and implementation of `html_escape()`

miyuchina / mistletoe

A fast, extensible and spec-compliant Markdown parser in pure Python.

MIT License

841 stars 119 forks source link

The implementation of html_escape() seems a bit inefficient and it also escapes " when it is not actually necessary.

Here is its source code:

@staticmethod
def escape_html(raw):
    return html.escape(html.unescape(raw)).replace('&#x27;', "'")

I think that html.escape()'s boolean parameter quote should be probably used instead of the call to replace(): set quote to False when escaping text outside of an attribute value, set it to True otherwise. The rendered result will change for the latter case, i. e. ' will be escaped, but it shouldn't matter, or should it?

miyuchina / mistletoe

HTMLRenderer: Revise usage and implementation of `html_escape()` #115