fiatjaf / svelte-asciidoc

AsciiDoc parser to customizable Svelte components
MIT License
2 stars 0 forks source link

Special characters render as code #2

Closed SilberWitch closed 1 month ago

SilberWitch commented 3 months ago

I've noticed in a few wiki pages, that some characters render as &#etc.

Example at the bottom of this one or this one.

SilberWitch commented 2 months ago

It's the apostrophes. All apostrophes come out as attestor’s

SilberWitch commented 1 month ago

Also, elipses like "..."

fiatjaf commented 1 month ago

Thank you for the reports. Please let me know if you find more ocurrences. This seems to be a bug in the Asciidoc parser, it escapes some characters in seemingly random instances, not following any reasonable pattern, so the solution was to track down these ocurrences and patch them.

SilberWitch commented 1 month ago

Looks good, thanks! 🫂

SilberWitch commented 4 weeks ago

I found some more:

&#8201 &#8212

Maybe find some way to look for "&#"?

From https://wikistr.com/gc-alexandria*dd664d5e4016433a8cd69f005ae1480804351789b59de5af06276de65633d319

fiatjaf commented 4 weeks ago

Thank you. It's so weird. That was supposed to be a --, but it gets transformed into and then garbled.

I tried a bunch of other characters from here: https://www.w3.org/wiki/Common_HTML_entities_used_for_typography But nothing seems to yield problems.

SilberWitch commented 3 weeks ago

I had the same problem, developing my own client that involves downloading some RSS feed. 🤷🏻‍♀️ Comes in with these weird characters.