Closed GoogleCodeExporter closed 8 years ago
Pandoc is working as it is supposed to here:
When reading Markdown or HTML, it converts all entities to unicode characters.
When writing HTML, it converts these characters to entities as needed:
<>"& are escaped; for everything else, UTF-8 is used. (As of r540, nonbreaking
spaces are also escaped as entities in HTML output.) When writing Markdown,
Pandoc uses UTF-8 for everything, using backslash-escapes when necessary.
In this respect, Pandoc behaves differently from Markdown.pl, which just
leaves entities alone. One reason for this difference is that Pandoc must
handle
LaTeX output, and entities are meaningless in LaTeX.
Original comment by fiddloso...@gmail.com
on 17 Feb 2007 at 4:04
I can see why having unicode nonbreaking spaces in the Markdown output is
problematic. As of r541, the Markdown writer uses " " for nonbreaking
spaces.
Original comment by fiddloso...@gmail.com
on 17 Feb 2007 at 5:00
Original issue reported on code.google.com by
bart.mas...@gmail.com
on 25 Jan 2007 at 7:31